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Abstract 

This is the first of two papers devoted to connections between asymptotic func- 
tions of groups and computational complexity. One of the main results of this paper 
states that if for every m the first m digits of a real number a > 4 are computable 
in time < C2^ ™ for some constant C > then is equivalent ("big O") to the 
Dehn function of a finitely presented group. The smallest isodiametric function of 
this group is n^/'*". On the other hand if n° is equivalent to the Dehn function 
of a finitely presented group then the first m digits of a are computable in time 

< C2^ for some constant C. This implies that, say, functions n'^'^^, and 
for all rational numbers a > 4 are equivalent to the Dehn functions of some finitely 
presented group and that and for all rational numbers a > 3 are equivalent 
to the smallest isodiametric functions of finitely presented groups. 

Moreover we describe all Dehn functions of finitely presented groups >- as 
time functions of Turing machines modulo two conjectures: 

1. Every Dehn function is equivalent to a superadditive function. 

2. The square root of the time function of a Turing machine is equivalent to the 
time function of a Turing machine. 
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1 Preliminaries 

A function / : N ^ N is called an isoperimetric function of a finite presentation V = 
{X I i?) of a group G if for every number n and every word w over X which is equal to 1 
in G, \w\ < n, there exists a van Kampen diagram over V whose boundary label is w and 
area < /(n); in other words, w is a product of at most f{n) conjugates of the relators 



from R (see [El, M, 13 



A function is called an isodiametric function of a finite presentation V = {X \ R) if for 
every number n and every word w over X which is equal to 1 in G, |w| < n, there exists 
a van Kampen diagram over V whose boundary label is w and diameter < f{n); in other 
words, w is a product of conjugates x^^riXi of the relators rj from R and the lengths of 



the words Xi are bounded by f{n) (see @], |]TT]|). 

The smallest isoperimetric function of a finite presentation V is called the Dehn func- 
tion of V. 

Let /, 5f : N — > N be two functions. We write f ^ g if there exist non-negative 
constants a,b,c,d such that f{n) < ag{bn) + cn + d. All functions g{n) which we are 
considering in this paper grow at least as fast as n. In this case f{n) ^ g{n) if and only 
if /(^) ^ ag{bn) for some positive constants a, b. Two functions f,g are called equivalent 
ii f ^ g and g ^ f. 

It is well known that Dehn functions and the smallest isodiametric functions corre- 
sponding to different finite presentations of the same group are equivalent (see |21] or 
[0], [0). This allows us to speak about the Dehn function and the smallest isdiametric 
function of a finitely presented group. Sometimes we shall also say that a function f{n) 
is the Dehn function (the smallest isodiametric function) of group G when in fact f{x) is 
only equivalent to the Dehn function ( the smallest isodiametric function ) of a presentation 
of this group. 

We shall not distinguish between equivalent functions in this paper. Thus we do not 
distinguish, say, functions 2^ and 3", n" and [n"], 1 and 2n. But we distinguish n" and 
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n^, where a and j3 are different numbers greater than 1. We also distinguish, say n" and 
n° logn for a > 1. 

It is known that every function where > 1 is an integer is the Dehn function 
of some finitely presented group. Some other functions like 2" are also Dehn functions. 



Some information about the class of Dehn functions can be found in |21[], |T2[, 0,0, [0] 



[ pin and other papers. It is also known that not every reasonable function is the Dehn 
function of some finitely presented group. For example, as was shown by Gromov [[T^] , 
there are no Dehn functions which are strictly between n and rt^. Thus there exists a gap 



between n and n^. Information about isodiametric functions can be found in |TT], 0. 

Several people including Hamish Short asked whether there are other gaps. Answering 
another question of Short, M. Bridson proved that there are finitely presented groups with 
Dehn functions equivalent n" for some non- integral rational numbers a 0. 

It is worth mentioning that a related class of growth functions of finitely generated 
groups contains many gaps. For example, from a well known theorem by Gromov [|I^ if the 
growth function of a finitely generated group G is bounded from above by a polynomial 
then G has a nilpotent subgroup of finite index. This easily implies that its growth 
function is equivalent to for some natural number k. Thus there does not exist a 
finitely generated group with growth function equivalent to, say, n^'^ or ri^^^. 

In this paper, we shall prove that for every relatively fast computable number a > 4 
there exists a finitely presented group with Dehn function equivalent to n" and the smallest 
isodiametric function equivalent to n'^"/^. On the other hand we shall prove that if n"" is 
equivalent to the Dehn function of a finitely presented group then a is a relatively fast 
computable number (see Corollary |T]^) . For example, is a Dehn function of a finitely 
presented group if a > 4 is rational or if a = vr + 1, etc. 

We show that there exists a close relationship between Dehn functions and complexity 
functions of Turing machines. This paper is the first of two papers where we explore this 
relationship, the next paper will be joint with A. Yu. Ol'shanskii. One of the main results 
of the present paper (Theorem |1 . 1| ) says that every Dehn function is a time function of 
some (not necessarily deterministic) Turing machine. The class of time functions of Turing 
machines is very large. It contains all relatively fast computable functions (see below). 
So it is very surprising that, as the other result of this paper states, there is virtually 
no difference between the class of Dehn functions >- and the class of time functions 
>~ n'^. As an immediate corollary we get that the class of Dehn functions is very large. 
For example all functions of the form n"(logn)^ for a > 4 are Dehn functions of finitely 
presented groups. 

There is a conjecture that every isodiametric function is the space function of some 
Turing machine. Notice that it is not known that the set of space functions > n differs 
from the set of time functions > n. 

In order to formulate the main results of our paper, we need some more definitions. 

Let M be an arbitrary (deterministic or nondeterministic) Turing machine. For every 
natural number n let T{n) be the smallest number such that for every acceptable word w 
with |w| < n there exists a computation of length < T{n) which accepts w. Notice that 
since M may be nondeterministic, there could be several accepting computations for the 
same initial configuration. The function T(n) is called the time function of the Turing 
machine M. 
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We call a function / superadditive if for all natural numbers m, n we have f{m + n) > 
f{m) + f{n). We do not know any Dehn function of a finitely presented group which is 
not equivalent to a superadditive function and Sapir conjectures that there are no such 
Dehn functions. In ||15| Guba and Sapir proved that every free product G * H where G 



and H are non-trivial groups, has a superadditive Dehn function. For example, for every 
group G the free product G * Z has a superadditive Dehn function (here Z is the infinite 
cyclic group). 

The main results of our paper are the following. 

Theorem 1.1 Every Dehn function of a finitely presented group is equivalent to the 
time function of some (not necessarily deterministic) Turing machine. 



Theorem 1.2 Let T)^ he the set of all Dehn functions d{n) > n'^ of finitely presented 
groups. Let T4 be the set of time functions t{n) > of arbitrary Turing machines^. Let 
be the set of superadditive functions which are fourth powers of time functions. Then 



This theorem is a corollary of Theorem O] and the following result. 



Theorem 1.3 Let L C be a language accepted by a Turing machine M with a time 
function T{n) for which T{n)^ is superadditive. Then there exists a finitely presented group 
G{M) = (A) with Dehn function equivalent to T{n)^, the smallest isodiametric function 
equivalent to T^{n), and there exists an infective map K : X+ — * (A U A"^)^ such that 

1. \K{u) \ = 0{\u\) for every u G X+; 

2. u E L if and only if K{u) = 1 in G; 

3. K{u) is computable in time 0{\u\) by a deterministic Turing machine. 

It is not known whether coincides (up to equivalence) with the set of superadditive 
functions in 7^. It is quite possible that these sets coincide. If this were true and in 
addition all Dehn function were superadditive (we have discussed this conjecture above) 
then X>4 would be equal to the set of all superadditive functions from T^. This is why we 
said before that the class of Dehn functions >- n'^ and the class of time functions >- 
virtually coincide. 

Using Theorem |1.3| and a variant of the Aanderaa construction (see [|l| or ^Q\) one can 
give a structural description of groups with word problem solvable in polynomial time. 
Let G = {A) be a finitely generated subgroup of a finitely presented group H = {B) and 
A ^ B. Then we define the Dehn function dcnin) of G in H as the smallest function 
f{n) with the following property: for every number n and every word w over A which is 
equal to 1 in G, \w\ < n, there exists a van Kampen diagram over H whose boundary 

^Recall that we do not distinguish equivalent functions so T4 is actuaUy the set of functions which are 
equivalent to time functions of Turing machines. 
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label is w and area < /(n); in other words, w is a product of at most f{n) conjugates of 
the relators from H. 

It is well known and easy to see (0) that the word problem of a finitely presented 
group is solvable if and only if the Dehn function is recursive. Nevertheless the difference 
between the computational complexity of the word problem and the Dehn function can 
be quite large (see pT[| ). 

Notice that if d{n) is the Dehn function of a finitely presented group G then there 
exists a "trivial" (non-deterministic) machine which checks if a word is equal to 1 in G. 
This machine simply inserts relators of G into the word until the word becomes 1. The 
time function of this machine is equivalent to d{n) (for the precise description of the 
"trivial" machine and the proof that its time function is equivalent to d{n) see in the 
proof of Theorem |1.1| ) . 

Thus the word problem in a group G can be decidable by a very fast machine but 
at the same time the "trivial" machine can be very slow. The following corollary shows 
that we can always embed G into another finitely presented group H such that the trivial 
machine for H solves the word problem in G almost as fast as the fastest Turing machine 
can. 

Corollary 1.1 The word problem in a finitely generated group G is solvable in poly- 
nomial time by a non- deterministic Turing machine if and only if G is embeddable into 
a finitely presented group H such that the Dehn function of G in H is bounded by a 
polynomial. Moreover, for every function T{n) > n, if the word problem in a group G 
is solvable in time < T{n) by a non- deterministic Turing machine then G is embeddable 
into a finitely presented group H such that the Dehn function of G in H is bounded by 
0{T{nY). 

One can also view this corollary as a result about distortions of areas of words in 
subgroups of finitely presented groups. By an area of a word w = 1 in G we mean 
the smallest area of a van Kampen diagram with boundary label w. Then the function 
dcnin) shows how distorted the areas of elements in the subgroup G oi H are. Theorem 
|1.1| shows that the function dc^uin) cannot be lower than the computational complexity 
T(n) of the word problem for G. Corollary |1.1| shows that the distortion can reach as low 
as T(n)^. 

We are not going to prove Corollary ^Tl] in this paper because in the next paper (joint 
with A. Yu. Ol'shanskii) we shall prove the following more powerful result. 

Theorem 1.4 The word problem of a group is decidable in polynomial time if and 
only if this group can be embedded into a group with polynomial Dehn function. Moreover 
every group with the word problem solvable in time T{n) can be embedded into a group 
with isoperimetric function T{n)^. 



The following corollary of Theorem 1.3 was the first result about Dehn functions of 



groups proved by Rips. It was the start point of this work. 

Corollary 1.2 (Rips) There exists a finitely presented group with undecidable conju- 
gacy problem and Dehn function equivalent to rt" . 
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The proof differs from the original proof of Rips and is given in the last section of this 
paper. 



The next corollary shows how to use Theorems |1.2| and |1.3| in order to find many 
different Dehn functions and isodiametric functions. 

With every function f{n) one can associate two computational problems. It is not 
very easy to define computability of functions by non-deterministic machines. So when 
we talk about computability of functions, we restrict ourselves to deterministic (multi- 
tape) Turing machines (although one can replace them with non-deterministic machines 
if one gives a "right" definition of computability). 

Problem A. Given a natural number n written in binary, compute f{n) in binary. The 
size of n is the number of digits of n, that is [log2 n] -|- 1 . 

Problem B. Given a natural number n written in unary (as a sequence of I's), compute 
f{n) in unary. The size of n is n. 

It is easy to see that if f{n) is a function such that Problem B is solvable in time 
0{f{n)) then f{n) is equivalent to the time function of a deterministic Turing machine 
(a Turing machine which computes f{n)). 

Corollary 1.3 Let f{n) > be a superadditive function such that the binary repre- 



sentation of f{n) is computable in time 0{\ f{n)) by a Turing machine (i.e., Problem 



A is solvable in time f{n))). Then f{n) is equivalent to the Dehn function of a 
finitely presented group and the smallest isodiametric function of this group is equivalent 
to T^l\n). 

Remark. Note that the size of n in Problem A is O(logn). Thus if, say. Problem 
A for a function f{n) is solvable in polynomial time then the condition of the corollary 
holds and f{n) is the Dehn function of some finitely presented group and the smallest 
isodiametric function of this group is equivalent to T^^^{n). 



Proof. By Theorem |1.2| it is enough to show that g{n) = [\l f{n)\ is equivalent to the 
time function of a Turing machine. 

Consider the following Turing machine M. The input of M is a natural number n 
written in unary. The machine has three tapes, with n (in unary) initially written on tape 
1, and tapes 2 and 3 are empty. Machine M first computes the binary representation of 
n. 

The algorithm is well known: 

Until the number n written on tape 1 is 0, 

write the remainder of n modulo 2 on tape 2, [n/2\ in unary on tape 3, 
erase tape 1, 

copy [n/2] from tape 3 to tape 1 and erase tape 3 
end of cycle. 

This algorithm takes 0{n) steps. After this, M computes f{n) in binary. By assump- 



tion, this takes at most 0{d f{n)) = 0{g{n)) steps. 
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Then M computes g{n) = fin)] in binary. Note that for every number m written 
in binary the number [^^/rn\ (in binary) can be computed in 0((logm)^) steps by the 
ordinary bisection algorithm. So M can compute the binary representation of g{n) from 
the binary representation of /(n) in 0((log/(n))^) steps. 

Finally M computes g{n) in unary. The algorithm is opposite to the one presented 
above. Again we use three tapes with g{n) (in binary) initially written on tape 1, and 
with tapes 2 and 3 empty. 

Write 1 on tape 2; 

move the head one step right on tape 1; 

until an empty square on tape 1 is to the right of the head, repeat: 
copy the content of tape 2 twice on tape 3 and erase tape 2 
copy the content of tape 3 on tape 2 and erase tape 3 

if 1 is written to the right of the head on tape 1 then write one more 1 on tape 2 
move the head one step right on tape 1 
end of cycle 

This takes at most 0{g{n)) steps. Thus the whole algorithm is executed in time at 
most 0{n)+0 {g{n))+0 {{log f{n))^)+0{g{n)) = 0{g{n)). This algorithm solves Problem 
B for g{n). Therefore g{n) is the time function of a Turing machine. By Theorem |1.2| , 
g{nY is equivalent to the Dehn function of a finitely presented group. Since f{n) is 
equivalent to g{nY, we conclude that f{n) is equivalent to the Dehn function of a finitely 
presented group. □ 

This corollary allows one to construct finitely presented groups with "arbitrary weird" 
Dehn functions and smallest isodiametric functions. The following general fact follows 



from Corollary 1.3 



Corollary 1.4 For every real number a > 4 such that the first m digits of a can he 
computed in time ^ 2^™ (for every m ) the function [n"] is equivalent to the Dehn function 
of a finitely presented group and the smallest isodiametric function of this group is n^^^". 
On the other hand if is the Dehn function of a finitely presented group then for every 
m the first m digits of a can he computed deterministically in time ^ 2 . 

Proof. Notice that the function [n""] is equivalent to the function 2[°'['°S2"-]]_ xhe func- 
tion [log2 n] (i.e. the binary expression of the number of binary digits in n) is computable 
in time < 0((log2n)^) by an obvious algorithm: scan the number n from left to right on 
one tape and after each step add 1 to the number on the other tape. 

Since the first [log2(log2 n)] + 1 digits of a are computable in time 0{n), the function 
[a;[log2n]] is computable in time 0{n) < 0(n"/^). 

Notice also that Problem A for a function equivalent to 2"^ is solvable in time 0{m). 
Indeed, we can consider the unary expression of m as the binary expression of 2™^^ — 1 
and use the second algorithm in the proof of Corollary |1.3|. It remains to apply Corollary 



i.a 



Now suppose that is the Dehn function of a finitely presented group. Then by 
Theorem is equivalent to the time function T{n) of some (non-deterministic) 
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Turing machine M. We can assume that when M accepts, all its tapes are empty and 
that M has only one accept configuration cq. We can also assume that M has an input 
tape which has number 1. 

By Gromov's theorem, we can assume that a >2. Then for any n > 

ein" < T(n) < e^n'^ 

for some positive constants ei and €2- Let number no be such that 2" > log2 €2 for every 
n > uq. Let g = [a] + 1. 

Consider the following deterministic Turing machine M' which computes T{n). Let k 
be the number of tapes of A4 . Then M' will have A; + 3 tapes. Tape A; + 1 will contain 
an input number n in binary, tape k + 2 will contain T{n) when M' stops, tape /c + 3 
is auxiliary. In the input configuration, M' has number n written on tape k + 1 and all 
other tapes empty. First of all it types the accept configuration of the machine M on the 
first k tapes and computes (in binary) on tape A; + 3. Then it considers all possible 
computations of length < n'^ of the machine (commands of M are applied backwards) 
starting with the accept configuration cq of M . The number of such computations is at 
most r""* where r is the number of commands of M. For each of these computations C, it 
calculates the length T(C) and the final word W{C) written on the first tape. It writes 
this information on tape k -\-2 provided |1^(C)| < n and the final configuration of C 
is an input configuration of M. Thus it produces a sequence 5, consisting of numbers 
T(C) and words W{C). The words W{C) in this sequence are all words of length < nfl 
accepted by M. It is clear that the time to produce this sequence and the length of it 
does not exceed Dn^r"' < 2" for some positive constants D, d, r. Then for every word W 
occurring in «S, M' calculates the minimal t{W) of the corresponding numbers T{C) (for 
all computations C such that W — W{C)). Then it finds the maximal number among 
all tiWy^. This number is obviously equal to T{n). After that M' writes T(n) on tape 
k + 2, erases all other tapes and stops. It is clear that in order to compute T(n) given 
the sequence S, M' needs time at most \S\'^ < 2" ^ for some constant di. Thus M' is a 
deterministic machine which computes T{n) in time < 2""^^ for some constant ^2- 

Given this machine M', consider the following Turing machine M" which will calculate 
the first m digits of a (for every m). This machine has k + A tapes with tape A; + 4 being 
the input tape. It starts with number m in binary written on tape k + 4 and all other 
tapes empty. Then it calculates the number n — 2^""^"° and writes it on tape k + 1 
(using tape A; + 2 as an auxiliary tape and cleaninig it after n is computed). Then 
M" turns on the machine M' and produces T{n) on tape k + 2. Then it calculates 
p = [(log2 T{n) — log2 ei)/2""] and writes it on tape A; + 2. Notice that a log2 n + log2 ei < 
log2 T{n) < a log2 n + log2 £2- Therefore 

[q;2"^] <P<Q;2"^ + (log2e2)/2"°. 

Hence p = [q;2™] , so p is the number formed by the first m + k binary digits of a where 
k is the number of digits in a before the period. From the construction of M", it is clear 

that the time complexity of M" does not exceed 2 < 2 for some constant ^3. 

□ 
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Notice that rational numbers > 4 and fast computable irrational numbers like vr + 1, 
e^, evr and others satisfy the conditions of Corollary p. .41 (see P]). It would be interesting 
to find out whether the number 2 in the second statement of Corollary |1.4| can be 
decreased to 2^™. This would give a necessary and sufficient condition for 71°" to be the 
Dehn function of a finitely presented group. 

Corollary [1.4| implies that not every polynomially bounded increasing computable 
function > rt^ is the Dehn function of a finitely presented group, (just take a very slowly 
computable number a and consider the function n"). This answers a question of Gersten. 



The plan of the paper is the following. First we prove Theorem 1.1: every Dehn 



function of a finitely presented group is a time function of some Turing machine. This 



implies one inclusion in Theorem |L^. In order to prove Theorem |1.3| and the other 
inclusion in Theorem |1.2| we start with an arbitrary multitape nondeterministic Turing 
machine with time function T{n). Our goal is to simulate a Turing machine in a group. 
The mere fact that a Turing machine can be simulated by a group is not new. Papers by 
Novikov and Boone showed it long ago. The problem that we face in this paper is that 
we need the Dehn function of the group to be not much bigger than the time function 
of the Turing machine. In fact all known simulations of Turing machines in groups (see 
[|l^) lead to groups with exponential Dehn functions. The same is true if we use Minsky 



machines instead of arbitrary Turing machines ||T7 . 

In order to overcome this difficulty, we introduce the concept of an ^-machine. An 
^-machine is a kind of Turing machine with one tape and many heads (several heads can 
move at once). Heads are placed between cells on the tape, so we consider their states 
as letters when we talk about the word written on the tape. A typical command of an 
S'-machine is g — apb^^ where q and p are states of some head and a is a tape symbol. 
Notice that this command does not depend on the content of the tape, only on the state 
of the head. It can be applied any time when the state of the head is q. Thus usually 
S'-machines are extremely non-deterministic. The alphabet of S'-machine is divided into 
subsets Y and of the same size, so the words written on the tapes can be considered 
as group words. After every step of the machine, the word on the tape is automatically 
reduced. We do not consider reducing the word a separate step, it is a part of execution 
of a command. 

For every Turing machine M with time function T{n) we find an S-machine S = S{M) 
which works in time T{n)^ and recognizes almost the same language. 

Then we define a finitely presented group Gn{<S) where is a natural number. For 
A^ = 1 this presentation resembles the presentation of Boone's group (see [^) but it does 
not have the Baumslag-Solitar type relations xyx~^ = which make the Dehn function 
exponential. 

In fact the S'-machines are needed precisely to avoid such relations. 
We shall prove that (for A^ large enough), the Dehn function of G]^{S) is T{nY, the 
smallest isodiametric function is T^{n) and that G]\f{S) satisfies the conditions of Theorem 



i.a. 



In order to do that, with every admissible word ly of 5 we associate a word K{W) over 
the generators of W. It is relatively easy to show that if W is accepted by S then there 
exists a "standard" van Kampen diagram {disc) with boundary label K{W), so K{W) = 1 
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in GAr(iS). In order to compute the Dehn function and the smallest isodiametric function of 
Gn{S) we consider an arbitrary van Kampen diagram A over the presentation of Gn{S). 
We define a planar graph associated with this diagram, whose vertices are subdiscs of 
A. For N big enough the degree of each vertex of this graph is greater than 8, so small 
cancellation theory is applicable to this graph (see I^Of). Using small cancellation theory, 
we decompose A in some natural way into simpler subdiagrams which either do not have 
subdiscs or are discs themselves. This is like decomposing a snowman into small snow 
balls. Then we show that the total area of the diagrams without discs is bounded by 
and the sum of the lengths of boundaries of discs is bounded roughly by 0(|Vr|). The 
area of a disc with perimeter n is bounded by the area of the corresponding computation 
of iS, that is by a function equivalent to T(n)^ and its diameter is equivalent to T^{n). 
Using the fact that T'^ is superadditive, we conclude that the total area of the maximal 
discs inside A is bounded from above by a function equivalent to T{n)'^. This gives us 
the upper bound of the Dehn function. 

To get the lower bound we consider an arbitrary van Kampen diagram with boundary 
label K{W) where W is an admissible word for S. We show that in this case W is accepted 
by S and that the area (diameter) of this diagram cannot be smaller than a constant times 
the area (diameter) of a disc corresponding to an accepting computation for W. This gives 
us the lower bound for the Dehn function and also proves that K{W) = 1 implies that 
W is accepted by S. 

In the last section of the paper we prove Corollary |1.2| . 



2 History 

This paper has a long and complex history. Several versions of this paper containing 
gaps appeared as preprints during the last 2 years, and many people have these wrong 
versions. In this section, we try to explain what were the gaps and how these mistakes 
were fixed. Without it several main ideas of this paper would be unjustified. Another 
reason for writing this section was that different authors had different input in this paper. 

We began working together on Dehn functions of groups during Rips' visit to Lincoln 
in Summer 1994. About a year before that Birget formulated the "embedding conjecture" 
that every finitely presented group with word problem solvable in polynomial time can be 
embedded into a finitely presented group with polynomial Dehn function. By that time the 
semigroup analog of this conjecture was proved by Birget. It appeared later in preprint 
[ffl. In this preprint, Birget also proved semigroup versions of the main results of this 



paper about isoperimetric functions: analogs of Theorems 1.1, 1.2, 1.3. He also presented 



a sketch of a proof of our Theorem |1.1| for groups: every Dehn function of a finitely 
presented group is equivalent to the time function of some Turing machine (the complete 
proof presented here is due to Sapir). 

Rips' idea of proving the "embedding conjecture" was the following. Consider the 
classic Novikov-Boone-Higman-Aanderaa embedding of a finitely generated group G into 



a finitely presented group H |]2^. The goal was to show that the Dehn function of H 
is not much bigger than the time complexity of a machine M solving the word problem 



in G. This idea was explored before in Madlener and Otto [21| but the estimates of the 
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Dehn functions there were too rough, they dealt only with Dehn functions which are well 
above exponential. In fact the main problem is that the "classic" Novikov-Boone-Higman- 
Aanderaa construction produces groups with at least exponential Dehn function because 
of the relations of the Baumslag-Solitar form a~^xa = x^. 

Recall that the Novikov-Boone-Higman-Aanderaa embedding has two steps [^. In 
the first step one takes a Turing machine M and simulates it in a group B{M). This 
means that there exists a map K from the set of configurations of M to the set of words 
in B{M) such that K{c) = 1 in B{M) if and only if the configuration c is accepted by M. 
In the second step one uses the Turing machine M solving the word problem of the group 
G and several relatively simple HNN-extensions of B[M) to produce a finitely presented 
group H containing G. 

Rips' program was, first, by modifying the Novikov-Boone construction get a group 
B'{M) also simulating M with Dehn function polynomially related to the time function 
of M and, second, show that HNN-extensions of Aanderaa does not change the Dehn 
function too much. 

In order to compute the Dehn function of a group Birget and Rips wanted to use 
geometric methods used by Rips in his proof of Corollary |l]2| and in his "geometrization" 
of the proof of Novikov-Boone theorem. This method counts the number of bands and 
annuli of cells in a van Kampen diagram, see Section |^ and Lemma of this paper. 
Notice that a similar method was used in studying van Kampen diagrams over HNN- 
extensions by Miller and Schupp in 1973 p2| . 

In 1994, Birget and Rips decided that by removing the Baumslag-Solitar relations and 
modifying the Turing machine M instead, they can still simulate the machine M in a 
group B\M) without exponential blow up of the Dehn function. It also looked possible 
that the geometric methods will give not just an upper bound of the Dehn function of 
the group B'{M) but the Dehn function up to the equivalence. This would give almost a 
complete description of all Dehn functions of finitely presented groups as time functions 
of Turing machines with some reasonable restrictions. This would solve a Short's problem 
of finding groups with exotic Dehn function, which Sapir was working on at that time. 
So Sapir joined Birget and Rips. 

Trying to implement our program we constructed a modification B'{M) of B{M) and 
proved several geometric lemmas similar to those in Section |^ were proved. 

Unfortunately these lemmas gave only an upper bound of the Dehn function which 
was way above the lower bound that we could get. It was not even clear that the Dehn 
function of B'{M) is polynomial provided M has a polynomial time function. To compute 
the Dehn function up to equivalence, further modification of B'{M) (with many sectors) 
and more geometric ideas (the snowman decomposition) were introduced and implemented 



mainly by Sapir. This gave us results similar to Theorems |1.2| and of this paper only 
instead of exponent 4, we had exponent 3. The paper was written in February 1996 and 
Sapir put a link to it on his Web page. Many people downloaded the paper. 

Immediately after that we started working on the second step of our program, the 
Higman- Aanderaa construction. Birget and Sapir introduced a many sectors modification 
of the Aanderaa construction and proved that this construction gives a Higman-type 
embedding of G into a finitely presented group H which was constructed as a sequence of 
HNN-extensions of B'{M). But when we tried to prove that the Dehn function of H is not 
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much bigger than the Dehn function of B'{M), we discovered that one of the geometric 
lemmas (Lemma 5.11 in the 1996 version of our paper) proved among the first results in 
1994 was wrong. In the "proof of this lemma we used a property of the Turing machine 
M which we did not have. As a result not only the computation of the Dehn function 
of the group B'{M) was wrong but even the fact that B'{M) simulates M in the sense 
mentioned above proved to be incorrect. 

At first this problem seemed easy to fix by modifying the Turing machine M. During 
seven months after that at least 20 different modifications of M were introduced. Each one 
of them seemed very promising at the beginning but turned out to be not better than the 
original modification later. The hope of ever fixing the mistake almost faded when Sapir 
realized that although the group B'{M) does not simulate the Turing machine M, it still 
simulates a "machine" related to M. These machines were called S'-machines. After that 
we faced a relatively standard problem in the theory of algorithms. We needed to prove 
that S'-machines are polynomially equivalent to ordinary Turing machines. This turned 
out to be very non-trivial. Sapir's proof of this fact presented in this paper is probably the 
longest proof of equivalence of two computing devices in the theory of algorithms. After 
this proof was obtained, the geometric part of the paper, - both the snowman construction 
and the proof of the lower bound of the Dehn function, - needed modifications also. It 
was done by Sapir. He also proved Corollary |1.4| about Dehn functions of the form n". All 
results about isodiametric functions in this paper are formulated and proved by Sapir as 
well. Since the proofs of these results depend heavily on the technique used in dealing with 
Dehn functions, it was unreasonable to write a separate paper on isodiametric functions, 
so we included these results here. 

Several comments by Ol'shanskii simplified the geometric part of our paper. We are 
grateful to him for these comments. We are also grateful to Victor Guba whose comments 
helped in writing the section about S'-machines. 

Ol'shanskii used several ideas of the 1996 version of our paper in his paper about 



distortions of subgroups in finitely presented groups |^. He also used a many-sector 
variant of the Aanderaa embedding. During his visit to Lincoln in January 1997, he and 
Sapir realized that a combination of S'-machines and geometric ideas of this paper and 
ideas of p4| gives a proof of Birget's "embedding conjecture". This completed the work 
started in 1994. 



3 Turing Machines 

In this section we collect all information about Turing machines that we need in the proof 



of our main results. We also prove Theorem 1.1 



We shall use the following standard notation for Turing machines. A (multi-tape) 
Turing machine has k tapes and k heads. One can view it as a six-tuple 

M={X,r,Q,Q,s,,so) 

where X is the input alphabet, F is the tape alphabet {X ^ T), Q = \J Qi, i = 1, k is 
the set of states of the heads of the machine, G is a set of transitions (commands), si is 
the fc-vector of start states, sq is the fc-vector of accept states. 
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The number of tapes k of the machine is determined by G. We assume that in the 
normal situation the machine starts working with states of the heads forming the vector 
si, with the head placed at the right end of each tape, and accepts if it reaches the state 
vector sq. In general, the machine can be turned on in any configuration and turned off 
at any time. 

The leftmost square on every tape is always marked by a, the rightmost square is 
always marked by uj. 

The head is placed between two consecutive squares on the tape. When we talk about 
the word written on the tape, we do not include a, u, and the state of the head. We 
assume that the machine can insert or delete squares on the tape but only at the right 
end (just before the cj-sign). 

At every moment the head observes two squares on each tape. 

A configuration of a tape of a Turing machine is a word auqvuj where q is the current 
state of the head, u is the word to the left of the head and v is the word to the right of 
the head. 

A configuration U oi a, Turing machine is a fc-tuple 

(f/l,f/2,...,f/fc) 

where f/j is the configuration of tape i. The length \U\ of this configuration is the sum of 
lengths of the words Ui. 

An input configuration is a configuration where the word written on the first tape is in 
X"*", all other tapes are empty, the head observes the right marker tu, and the states form 
the start vector Si. An accept configuration is any configuration where the state vector is 
So, the accept vector of the machine. 

A transition of a Turing machine is determined by the states of the heads and some 
of the 2k letters observed by the heads. As a result of a transition we can replace some 
of these 2k letters by other letters, insert new squares in some of the tapes and move the 
head one square to the left (right) with respect to some of the tapes.0. 

For example in a one-tape machine every transition is of the following form: 

uqv — * u'q'v' 

where m, f , m', v' are letters or empty words. The only constraints are that no letter can 
be inserted to the left of a or to the right of uj and the end markers cannot be deleted. 
This command means that if the state of the head is g, u is written to the left of q and v 
is written to the right of q then the machine must replace m by m', g by q' and v hy v' . 
For a general fc-tape machine a command is a vector 

{Ui^Vi,...,Uk^Vu] 

where t/j — is a command of a 1-tape machine, the elementary commands are listed 
in the order of tape numbers. In order to execute this command, the machine checks if 
Ui is a subword of the configuration of tape z (z = 1, /c), and then replaces Ui hj Vi. 

^In different books, one can find different definitions of Turing mactiines, but all these definitions are 
equivalent: the machines recognize the same languages, and have equivalent complexity functions. 
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Notice that for every command {Ui Vi, ...,Uk Vk}, the vector {Vi Ui, ...,Vk ^ 
Uk} is also a command of a Turing machine. These two commands are called mutually 
inverse. 

A computation is a sequence of configurations wi,...,Wn such that for every i = 
1, ...,n — 1 the machine passes from Wi to Wj+i by applying one of the transitions from 
G. A configuration w is said to be accepted by a machine M if there exists at least one 
computation which starts with w and ends with an accept configuration. 

A word u G X* is said to be accepted by the machine if the corresponding input 
configuration is accepted. The set of all accepted words over the alphabet X is called the 
language accepted by the machine. 

Let C = (ci, Cg) be a computation of a machine M such that for every j = 1, ...,g — l 
the configuration Cj+i is obtained from cj by a transition rj from B. Then we call the 
word ri . . . rg_i the history of this computation. The number g will be called the time 
(duration) of the computation. Let Pi [i = l,...,g) be the sum of the lengths of the 
configurations of the tapes in the configuration q. Then the sum of all Pi will be called 
the area of computation and will be denoted by area(C). 

If the set of transitions 6 is divided into a number of disjoint parts UILi ®« then every 
computation C can be represented as a "concatenation" of computations 

C1C2.... (1) 

where each Ci is a computation of one of the Qj, the last configuration of Cj is the first 
configuration of Cj+i and Ci are maximal blocks of C where rules from exactly one part Cj 
are applied. We shall use such representations often when a program of a Turing machine 
(and, later, a program of an S'-machine) is divided into subroutines. 

With every Turing machine one can associate five functions: the time function T{n), 
the space function S{n), the generalized time function T'{n), the generalized space function 
S'{n), and the area function A{n). These functions will be called the complexity functions 
of the machine. 

For every natural number n the number T{n) is the minimal number p such that for 
every accepted input configuration w (with \w\ < n) there exists at least one accepting 
computation of length at most p. The number S{n) is the minimal natural number p such 
that for every accepted input configuration w (with \w\ < n) there exists at least one 
accepting computation which contains only configurations of length < p. The definitions 
of T'{n) and S'{n) are similar but we consider arbitrary accepted configurations w of 
length n, not just the input configurations as in the definitions of T{n) and S{n). It is 
clear that T{n) < T'{n) and S{n) < S'{n) and it is easy to give examples where the 
inequalities are strict. The area function A{n) is defined as the minimal number p such 
that for every accepted configuration w (with \w\ < n) there exists at least one accepting 
computation of area at most p. 

We do not only consider deterministic Turing machines, for example, we allow several 



transitions with the same left side. The following proof of Theorem |L1| shows that non- 
deterministic machines arise naturally when one works with the word problem. In this 
proof we associate with every group G a nondeterministic Turing machine M such that 
M accepts those and only those words which are equal to 1 in G, and the time function of 
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M is equivalent to the Dehn function of G. Actually the machine simulates the standard 
way of deriving relations of a group from defining relations. The nondeterminism of the 
machine reflects the fact that for every word there are usually many relations applicable 
to this word. Computations of the machine M correspond to derivations in the group 
G. Although an idea of the proof of this statement can be found in Birget [Q, here we 
present the first complete proof. 

Let us recall the statement of Theorem |1 . 1| . The Dehn function of a finitely presented 
group is equivalent to the time function of a two-tape Turing machine. The language 
accepted by this machine coincides with the set of words equal to 1 in the group. 



Proof of Theorem |1 . 1| . Let G = {X \ R) he a. finitely presented group. We assume 
that R is closed under taking inverses and cyclic shifts. Let U be the finite set of all pairs 
of words {u,v) from (X U X~^)* such that uv~^ is a relation from R. Consider a Turing 
machine M with alphabet X U two tapes and the following program. 

The program starts in state qi with the head at the right end on each tape, a word w 
written on the first tape and empty second tape. The head always stays next to the right 
marker during the work of our Turing machine. The alphabet is X U X~^. The program 
consists of the following operations. 

1 (move). If the word written on one of the tapes has letter a at the right end, the 
machine can erase the square containing this letter and insert a square with in it on 
the other tape. This operation requires 1 transition: {aqiu qiu, qiu a'^qiu} 

2 (substitution). If the word written on the first tape has suffix u and there exists a pair 
{u,v) from U then the machine can replace m by t;, by deleting the |m| squares on the first 
tape and inserting |f | squares on the first tape containing the word v. It is easy to see that 
this operation requires at most |m| + \v\ transitions of the forms {aqiU qju, qiU — qju}, 
{qiUJ aqju, qiU qju}. 

3 (reduction). If the word written on the first tape and the word written on the 
second tape have the same rightmost tape letter, then the machine can erase the squares 
containing these letters. This operation requires one transition of the form {aqiuj — * 
qiiv, aqiuj qioj} 

4 (accept). The machine accepts if both tapes are empty. 

It is easy to prove by induction that after any number of these operations the product 
of the word written on the first tape and the inverse of the word written on the second 
tape is equal to w in the group G. Therefore a word w is accepted by our machine only 
if it is equal to 1 in G. 

Conversely, if a word w is equal to 1 in G then there exist a derivation w ^ Wi — > 
... ^ 1 such that at each step we either insert /delete a word r G -R or insert /delete a 
subword of the form aa~^ where a G X U X~^. It is clear that there exists a computation 
of our Turing machine which simulates this derivation. Therefore w is accepted by the 
machine M. Thus the language accepted by M consists of all words which are equal to 1 
in G. 

Let us prove that the time function t{n) of the machine M is equivalent to the Dehn 
function d{n) of the group G. Let {auipiuj.avipioj), . . . .{aUmPm^^fy-VrnVm^) be an ac- 
cepting computation of our machine where ui = w, pi = qi, vi = Um = Vm = 1, Pm = qo, 
and for every i = 1, . . . , m — 1 the machine passes from the i-th configuration to the i + l- 
st configuration by executing one of the operations 1-4. Then there exists a derivation 
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—> u»i 1 of length at most m where at each step one of relations from R is 

applied. This implies that there exists a van Kampen diagram with at most m cells and 
boundary label w. Thus t{n) > d{n) (n = \w\). 

On the other hand suppose that there exists a van Kampen diagram A over the 
presentation of M with boundary label w. Using this diagram we shall construct an 
accepting computation of the machine M for the word w of length at most ddw]) + jwl. 

We shall call an edge of A a bridge if after removing this edge the graph A becomes a 
union of two connected components (it is clear that we won't get more than two connected 
components) . 

We shall describe a process of "deconstructing" A. For every i = 1, 2, we define a 
diagram Aj, a path ti consisting of bridges in Aj, a vertex Vi, a subdiagram A^ of Aj, and 
two words Xi and i/i forming a configuration 7j of the machine M. By induction on i we 
shall prove that for every i > 1: 

a) Vi is the terminal vertex of ti and initial vertex of A^, the initial vertices of ti and 
Aj coincide, t~^ is a subpath of 9(Ai). 

b) The path ti has no common edge with A^, ti consists of bridges of Aj. 

c) If i > 1 then the diagram Aj is a subdiagram of Aj_i. 

d) The path ti is simple. 

e) If ti is not empty then A^ is a connected component of the graph obtained from Aj 
by removing the last edge of ti. 

f ) If Aj has more than one vertex then either ti is not empty or A^ is not empty. 

g) If i > 1 then 7j is obtained from 7j_i by one of the 3 operations of the machine M 
(move, substitution, reduction). 

h) Ui — Lab(tj), Xiy~^ is the label of the contour of Aj. 

Let Ai = A, let Vi be the initial vertex of A, let ti be an empty path with the initial 
vertex t{t) and terminal vertex T{t) equal to Vi. Let A'^^ = Ai. Then |/j = 1 and Xi = w. 
This means that 71 is an initial configuration of the machine M. It is clear that properties 
a)-h) hold. 

Assume that for some i > 1 we have defined Aj, A^, ti (vj, Xi and yj are determined 
by Aj, A^ and ti). If Aj consists of one vertex then the process stops. So suppose that 
Aj has more than one vertex. 

1 (move). Suppose first that v is an initial vertex of a bridge e in A^ such that (tjc) 
is a subpath of 9(Aj). Then we set t'j+i = '''(e), ij+i = tiC. Since e is a bridge, removing 
this edge from A^ divides A' into two connected components. The connected component 
containing r(e) will be denoted by A^^^^. We also let Aj+i = Aj. Notice that the properties 
a)-f) hold for the quadruple (Aj+i, A^_,_^, Uj+i, tj+i). The only non-obvious property is d), 
but it follows from the fact that e e Aj and that ti has no common edges with Aj (by 
the induction hypothesis). The configuration 7j+i is obtained from 7j by a move. Indeed, 
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notice that the subpath p of d{A) labelled by Xi is not empty (otherwise ti would be a 
closed path, a contradiction with d) ). Therefore is the last edge of p, so the label 
of e is the last letter of w. As a result of the operation, edge is subtracted from p 
and edge e is added to ti. Thus (xj+i, j/j+i) is obtained from {xi,yi) by removing the last 
letter of Xi and adding the inverse of this letter to Ui. This is the move operation. Thus 
properties g) and h) also hold. 

2 (substitution). Now suppose that Vi is not an initial vertex of any bridge e in 
such that (tjc)"^ is a subpath of d{Ai) and suppose that there is an edge e in with 
initial vertex Vi and a cell vr containing e. Let Aj+i (resp. A'^^^ ) be the result of removing 
e from Aj (resp. A^). We also let fj+i = Vi, tj+i = ti. It is clear that properties a)-f) 
hold for the new triple (Aj+i, A^^^, tj+i). The path 9(Aj+i) is obtained from 9(Aj) by 
replacing e with the rest of the boundary of the cell vr. Thus Xj+i is obtained from Xi 
by replacing the last letter Lab(e) with a word r such that Lab(e)r~^ is a relation in R. 
Thus 7j+i is obtained from ji by a substitution. Therefore properties g) and h) hold. 

3 (reduction). Finally there exists a possibility that A^ = {f,}. By e), since Aj 7^ {vi}, 
ti is not empty. Let e be the last edge of t^. By e) Vi is a connected component of Aj\{e}. 
This means that the boundary of Aj has a subpath ee~^ or e~^e. Then by removing the 
edge e and the vertex Vi from Aj we obtain a new diagram which will be denoted by Aj+i. 
Let fj+i be the initial vertex of e and let tj+i be the result of removing e from ti. Now if 
ti+i is empty then let A^^^ = Aj+i. If ti+i is not empty then let e' be the last edge of tj+i. 
Since e' was a bridge in the diagram Aj, it is a bridge in the subdiagram Aj^^. Therefore 
by removing e' from Aj+i we get a graph with two connected components. The connected 
component containing t>j+i is denoted by A'-^^. Again it is clear that all properties a)-f) 
hold for the new triple (Aj+i, Aj^^, tj+i). By assumption Xj and i/i have a common last 
letter Lab(e). The words Xj+i and ?/j+i are obtained from Xj and t/i by removing this 
letter. This is a reduction. Therefore properties g) and h) hold. 

Let E be the number of edges in A. Let us prove that by using at most 2E moves, 
substitutions and reductions, we can reduce A to a one-vertex diagram. Indeed, at every 
step we either increase the path ti by one edge or delete at least one edge from the diagram. 
All these edges belong to the original diagram A. Since tj is a simple path the number 
of edges of A which belong to at least one of the tj's does not exceed E. The number of 
reductions and substitutions also cannot exceed E. Therefore after at most 2E steps the 
process must end. But the process ends only when Aj consists of one vertex. Therefore 
after at most 2E steps we reduce A to a one-vertex diagram. 

Recall that the triple {Ai, A[,ti) corresponds to the initial configuration 71 where 
w = Lab(5(A)) is written on the first tape and the second tape is empty. By property h) 
for every i > 1 the configuration 7j+i is obtained from 7j by one of the three operations 
(move, substitution, reduction). Since in the last triple Aj is a one vertex diagram, 
the last configuration has empty tapes, i.e. it is the accept configuration. The number of 
operations needed to transform 71 into the accept configuration is the same as the number 
of operations needed to "deconstruct" A, that is at most 2E. Since each operation of 
M translates into a constant number of elementary transitions, there exists an accepting 
computation of M starting with 71 whose length does not exceed a constant times E. It is 
well known pO[ that in any diagram A over a finite presentation the number of edges does 
not exceed the length of d{A) plus a constant multiple of the number of cells. Therefore 
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the time function t{n) of M does not exceed a constant times n + d{n), where d{n) is the 
Dehn function of G. It remains to notice that d{n) is equivalent to C{d{n) + n), for any 
constant C. The theorem is proved. □ 

Notice that the machine that we constructed in this proof has two tapes and it has no 
commands which move the head away from the endmarker uo. As we shall see later every 
Turing machine can be converted into a Turing machine recognizing the same language, 
having the same complexity functions, and having no head moving commands. On the 
other hand, it is easy to construct a one tape machine which is equivalent (in the above 
sense) to the machine constructed in this proof. One has to just concatenate the first tape 
of our machine with the symmetric image of the second tape (of course, one would also 
to remove the right endmarker of the first tape, the left endmarker on the second tape 
and identify the state letters on the two tapes). Thus every Dehn function of a finitely 
presented group is equivalent to the time function of a one tape Turing machine. 

In the next section we will show how to convert any Turing machine to an jS'-machine 
The following lemma is a small preliminary step in this conversion. 

Lemma 3.1 For every Turing machine M recognizing a language L there exists a 
Turing machine M' with the following properties. 

1. The language recognized by M' is L. 

2. M' is symmetric, that is, with every command U ^ V it contains the inverse 
command V —>■ U. 

3. The time, generalized time, space and generalized space functions of M' are equiva- 
lent to the time function of M . The area function of M' is equivalent to the square 
of the time function of M. 

4. The machine accepts only when all tapes are empty. 

5. Every command of M' or its inverse has one of the following forms for some i 

{qiu q[u;, qi^iU ql_^u;, aqiU ((^u), q^^^LO ^-^lO;, ...} (2) 

{qxu q[uj, qi_iu g^'.^o;, aq^u aq^^u, qi+iU q[^^-^u, ...} (3) 

where a belongs to the tape alphabet of tape i, and qj, q'j are state letters of tape j. 
Thus if the head observes the right markers at the beginning of a computation, it 
will observe the right markers during the whole computation. If the head does not 
observe the right markers at the beginning, then no command is applicable, and so 
the computation is trivial. 

6. The letters used on different tapes are from disjoint alphabets. This includes the 
state letters. 

Proof. First we show how to construct a machine M' satisfying the first four proper- 
ties. Then we shall show how to transform it into a machine satisfying all six properties. 
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The symmetrization of an arbitrary Turing machine is essentially well known (see 0]). 
We modify the construction from here. 

Let M be a /c-tape Turing machine. We can assume without loss of generality that 
the first tape of M is the input tape, that is it can contain letters only from the input 
alphabet. If such a tape does not exist, we can always add it to the machine without 
changing the complexity functions or the language accepted by the machine. 

Recall that in an input configuration of M, a word is written on the first tape, all 
other tapes are empty, the head is next to the right endmarker on each tape. 

We can also assume that the machine M has only one accept command, that is a 
command in which the states in the right sides form the vector sq. 

The machine M' has k + 1 tapes and is a composition of three machines which we call 
phases 1, 2 and 3. The tape alphabet of the k + 1-st tape is the set of commands G of 
the machine M. The tape alphabets of the other tapes are the same as in M. The input 
alphabet of M' coincides with the input alphabet of M. 

In the first phase M' inserts a sequence of squares containing commands of machine 
M on tape k + 1. The only constrain is that the first inserted square must contain the 
accept command. 

Thus after the first phase, the machine M' has a sequence of commands of M written 
on the k + 1-st tape. Then the machine checks if all tapes except tapes 1 and k + 1 are 
empty, and proceeds to the second phase. This can be done by one transition of the form 

[qiuo q[uj, aq2UJ acl^i^, aq^^u) aq'^^uj, quj q'uj}. (4) 

In the second phase M' tries to execute on the first k tapes the sequence of commands 
written on tape + 1, reading them from the right to the left. In order to make the 
machine M' do this, we replace every command 

of the machine M by the command 

r' ={U^^Vi,...,Uu^ Vk, rq ^ qr} 

where q is the state of M' on tape k + 1 (this state is not changed by any of the commands 
of M'). This command executes the transition r on the first k tapes of M', checks if r is 
written next to the left of the head on tape k + 1 (if r is not written there, the command 
t' is not executed), and moves the head on tape k + 1 one square to the left. 

The second phase of the machine M' is deterministic. It is also injective, that is for 
every configuration of phase 2 there is at most one command of M' whose inverse is 
applicable to this configuration. Indeed, this command is determined by the letter on 
tape k + 1 which is written next to q on the right. 

If (and only if) it turns out that the sequence of commands written on tape k + 1 
during the first phase is a valid history of an accepting computation of machine M, that 
is if the head moves next to the left endmarker on tape k + 1, the machine M' returns 
the head to the right endmarker on tape k + 1 and passes to the third phase. 

In the third phase the machine erases one by one all squares on all tapes and accepts. 
The third phase of M' is deterministic. 
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Let Sym(M') be the machine obtained by symmetrizing M', that is by adding the 
inverses to all commands of M'. Consider an arbitrary reduced computation of Sym(M'). 
Since the second and the third phases of M' are deterministic and the first and the second 
phases are injective, every reduced computation of Sym(M') can be represented in the 
form (0): 

^ ~ "-"3,l'-"2,l'-"l,l'-^l,2'-^2,2'-'3,2'-^3,3'-^2,3---- 

where Cij is a computation of phase i. Some prefix and some suffix of this sequence 
may be empty, the other blocks are not empty. Now suppose that this computation is 
accepting. Then it contains a block Cf] for some j. Let £ be the length of the first 

-1-1 

configuration c in C^ j . One can apply to the configuration c commands from phase 3 of 
M' and get to an accept configuration in fewer than £ steps because each step on phase 3 
removes one square of the tapes of M' . Let be the corresponding computation. It is 
clear that the suffix of the computation C after the configuration c has length not smaller 
than the length of C'^^y The area and the space of this suffix are also not smaller than 
the area and space of C3 So we can replace this suffix of the computation C by C3 As 
a result we shall get a computation with no greater length, area and space. Thus we can 
assume that C has only one block of the form Cf^ and of course (since C is accepting) it 
should have the form Caj. This means that C has the form 

*-"2,l *-"l,2<-'2,2t^3,2 

where the first 3 blocks may be empty. Let I be the length of the first configuration c in 
C. 

If the blocks 6*2,1, Ci,i, Ci,2 are empty then the length of the computation cannot 
exceed li because the length of the block 6*2,2 does not exceed the length of the word 
written on tape + 1 in c, and the length of 63,2 does not exceed i. The space of this 
computation does not exceed i because the length of configurations does not get bigger 
during this computation. For the same reason the area does not exceed 0(J?\ 

Suppose that Ci,2 is not empty and let c be the first configuration of Ci,2. Then we 
can apply inverses of the commands in phase 1 and get to the input configuration. Indeed, 
tapes 2, 3, A; in c must be empty, and the head must observe the right endmarker on 
each tape: otherwise the machine would not be able to execute the command and 
pass from Ci,2 to C2,2- Also since the machine passes from 62,2 to the third phase, the 
first (from the left) square on tape + 1 must contain the accept command during the 
computation 6*2,2 • Since the content of tape + 1 does not change during the work of the 
second phase, the accept command must occupy the first square of tape + 1 at the end 
of the computation 61,2. Therefore by applying inverses of the commands in phase 1, M' 
can erase all squares on tape + 1 and get to the input configuration of M' . 

Let (6^2)"^ be the corresponding computation. Then 

2,l*-'l,ll'-'l,2j L'l,2'-^l,2<-'2,2<-'3,2 

is an accepting computation. The length and the space of the prefix 62^^6^1(6^ 2)"^ is 
0(£) and the area of this prefix is 0(£^). The first configuration c of 6{ 2 is an input 
configuration of M' whose length is < I. Since M' accepts this configuration, the 
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sequence s of commands written on the k + 1-st tape after Ci,2 is a history of accepting 
computation of the machine M. Therefore M accepts the word written on the first tape 
in c. Let m be the length of this history word s. From the description of M', it is clear 
that then the length of C is at most 0{i)+0{m), the space is at most m + £ and the 
area at most 0(m^ + Therefore if we replace s by the history word of a shortest 
computation of the machine M accepting the input word of configuration c, the length, 
space and area of the corresponding computation of M' cannot be greater then the length, 
space and area of C. Thus we can assume that m = T{i'), where T is the time function 
of the machine M. This implies that the length, space and area of C is, respectively, 
0(T(f ) + £), 0(T(f ) + £), 0{T\i') + f). Since £' < £, we can conclude that the time 
function, the generalized time function, the space function and the generalized function 
of M' are equivalent to the function T, and the area function of M' is equivalent to T^. 

We have also proved that every word accepted by Sym(M') is also accepted by M. 
The converse statement is obvious (in fact every word accepted by M is also accepted by 
M'). This proves that Sym(M') satisfies properties 1, 2, 3, 4 of the lemma. 

Thus we can assume that the original machine M satisfies properties 1-4. 

In order to get property 5, we divide every tape i of M into two tapes, and number 
them i and i + 1/2. If a configuration of the old tape i is auqvu then the configurations 
of the new tapes i and i + 1/2 will be, respectively: 

auquj 

and 

avquj 

where v is the word v written from right to left. Then we replace every command 

{uiqivi u\q!^v[, ...,UkqkVk ^ uWM 

by the command 

{uiqiu u[q[uj,viqiu v[q[uj, ....UkquUJ u'l^q^uJ^VkqkUJ v'/^qki^} 

Here v is either f , if f is not the right marker, or a if v = u. This vector of commands has 
2k components listed in the order of the tape numbers (1, 3/2, 2, 5/2,...). It is clear that 
this machine recognizes the same language as M and has the same complexity functions. 
Now one can replace each command 

{wiqiUJ w[q[uj, ....} 

by a sequence of 2k commands of the form 

{qiuj q[uj, WiqiUJ w-qiUJ, ...} (5) 

(only the i-th component of this command has a non-state letter different from ou). This, 
of course, requires increasing the the set of states (each of these new commands uses 
new state letters). The application of each of these commands may change the word 
written on only one tape. Together these commands do what the initial command does. 



21 



This transformation does not change the language accepted by the machine, it does not 
decrease the complexity functions and can increase them not more than by a constant 
factor (< 2k). Thus we can assume that every command of M has the form 

Notice that if Wi = a then the command (^) is of the form @ because w'^ is also equal 
to a. Otherwise we can replace this command by two commands 

{qiu q'luj, ...,WiqiUJ q"uj, ...} 

and 

{q'lio q[uj, q-uj w'^qluj, ...} 

where q'J are new state letters (which we add to the set of state letters) . Notice that these 
commands have the desired form (H). It is clear that the new machine recognizes the 
same language and each complexity function can increase by a factor of at most 2. All of 
the commands of the new machine have the desired form or 

The last property is easy to get. To get property 6, one just needs to replace the 
alphabet of each tape by a disjoint copy of this alphabet, and the set of state letters on 
each tape by a disjoint copy of this set. Then of course one needs to change the commands 
of the machine accordingly. □. 



4 /S-machines 

In this section, we define S-machines and prove that ^-machines are "polynomially'' 
equivalent to multi-tape Turing machines. 

We shall define S-machines as rewriting systems ||r^. Let n be a natural number. 



A hardware of an ^-machine is a pair (F, Q) where Y is an n- vector of (not necessarily 
disjoint) sets Yi, Q is a (n + l)-vector of disjoint sets Qi, \JQi and [JYi are also disjoint. 
The elements of U Yi are called tape letters, the elements of U Qi are called state letters. 

With every hardware S = {Y, Q) we associate the language of admissible words 
L(iS) = QiF(Yi)Q2...F{Yn)Qn+i where F(Yj) is the language of reduced group words 
in the alphabet Yj Ul^"^. This language completely determines the hardware. So instead 
of describing the hardware S, one can describe the language of admissible words (which 
is in many cases more convenient). 

If0<i<j<n and W = qiUiq2...Unqn+i is an admissible word then the subword 
qiUi...qj of W is called the {Qi, (5j)-subword of W {i < j). 

An S'-machine with hardware 5 is a rewriting systems. The objects of this rewriting 
system are all admissible words. 

The rewriting rules, or S-rules, have the following form: 

[Ui^Vi,...,U^^Vm] 

where the following conditions hold: 

Each Ui is a subword of an admissible word starting with a Q^-letter and ending with a 
Qr-letter (where £ = must not exceed r = r{i), of course). 

If i < i then r{i) < 
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Each Vi is also a subword of an admissible word whose Q- letters belong to Qe(^i)U...UQr(i) 
and which contains a Q^-letter and a Qr-letter. 

If = 1 then Vi must start with a Qi-letter and if r(m) = n + 1 then Vn must end 
with a Qn+i-letter (so tape letters are not inserted to the left of Qi-letters and to 
the right of (5„+i-letters). 

To apply an S'-rule to a word W means to replace simultaneously subwords Ui by 
subwords Vi, i = 1, ...,m. In particular, this means that our rule is not applicable if one 
of the Ui^s is not a subword of W. The following convention is important: 

After every application of a rewriting rule, the word is automatically re- 
duced. We do not consider reducing of an admissible word a separate step of 
an ^-machine. 

For example, if a word is 

qiaaq2bq3ccq4 

and qi G Qi, a ^ Yi, b E Y2, c E Y3 and the S'-rule is 

[qi pia~^, q2bq3 a~'^p2b'q3c], (6) 
where pi G Qi, P2 G Q2, b' G 1^2, then the result of the application of this rule is 

P\P2bl qzcccq^. 

With every S'-rule r we associate the inverse S'-rule in the following way: if 
r = [[/i XxV'iVx-, U2 -> X2V2y2, f/„ ^ XnY^y^] 
where starts with a (5£(j)-letter and ends with a Qr{j)-letter, then 

^-1 = [v; x-,^U,y^\ X2 ^t/2Z/2~\ K ^ x-'Unyn\ 

For example, the inverse of the rule (P) is 

[pi qia,p2b'q3 aq2bq^c'\ 

It is clear that is an S'-rule, (t^^)^^ = r, and that rules r and cancel each 
other (meaning that if we apply r and then r^^, we return to the original word). 
The following convention is also important: 

We always assume that an S'-machine is symmetric, that is if an S-machine 
contains a rewriting rule r, it also contains the rule r^. 

As in the case of Turing machines, we can define the history of a computation of an 
S-machine as the sequence (word) of rules used in this computation. A computation is 
called reduced if the history of this computation is reduced, that is if two mutually inverse 
rules are never applied next to each other. 

As usual the length of a computation Wi,...,Wn is n, the space is max{|iyj| | i = 
1, n}, and the area is J2i\^i\- 

We say that a computation of an S'-machine is proper if no new negative letters appear 
in a word during this computation. We say that a computation is semiproper provided a 
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new negative letter inserted during one step of the computation never disappears during 
the rest of the computation. 

It is easy to see that if a computation 

C={W^,W2,...,Wn) 

is semiproper then the inverse computation 

C-' = {Wn,...,W2,W^) 

is also semiproper. 

For example, let /c = 2, Yi = {5} = Y2, Qi = {a}, Q2 = {gi,go}, Qs = {^^}- Suppose 
further that the S'-machine S has just four rules: 

(1) qi a'^qia, 

(2) aqi ago, 

and their inverses. 

Here is an example of a proper computation of this S'-machine: 



aaaaqiaau — > aaaqiaaau aaqiaaaau aqiaaaaau — >■ aqoaaaaau. 

The history of this computation is (1)(1)(1)(2). 

Here is an example of a semiproper computation: 

aqiu — > aa~^qiauj aa^^ qiaau . 

The history of this computation is (1)^^(1)^^. It is also possible to prove that every 
reduced computation of this machine is semiproper. 

There exists a "natural" way to convert a Turing machine M into an S'-machine 



S. The idea is simple. Take a Turing machine satisfying all conditions of Lemma ^TT 
Concatenate all tapes of the machine M together and replace every command aqu —>■ q'uj 
by quj — a~^q'u, so that commands of M become S'-rules. Unfortunately the S'-machine 
S constructed this way does not inherit most of the properties of the original machine M. 

Consider the following typical example. Let M be a one tape Turing machine with 
X = Y = {a}, Q = {qi, q2, go} and the following commands: 

1. aqiu q2U! 

2. aq2UJ — > q2UJ 

3. aq2UJ aqou 

plus all inverses of these commands. It is obvious that this machine accepts the word 
a" if and only if n is strictly positive. The corresponding S'-machine has the following 
commands: 



1. qiuj a q2UJ 
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2. q2UJ a ^q20J 

3. aq2UJ aq2UJ 

and all inverse commands. This machine accepts the word aa"'qiuj for every integer n. 
For example, let n = — 1. Then the computation is the following (we write the number of 
a command in parentheses; denotes the inverse of command number i): 

aa'^qiuj (1) 
aa~^a^^q2UJ —>■ {2)^'^ 
aa~^q2UJ (2"^) 
aq20J (3) 
aqouj 

In fact it is nontrivial to construct an ^-machine which recognizes only positive powers 
of a. Such a machine will be the key block in the main construction of this section. 

Let S be an S-machine, let W be and admissible word and let Wi,...,Wk be some 
words. Then S{W,Wi, w^) is the statement saying that there exists a computation of 
S starting with W and ending with a word containing wi,...,Wk- The set of all such 
reduced computations will be denoted by CS(W,Wi,...,Wk). If this set has only one 
element, this element will also be denoted by CS(W,wi, ...,Wk). If C is a computation 
then tC is the last word of C. 

We shall need the following two general results. They have a "diagram group nature" 

H- 

Lemma 4.1 Let S be an S -machine, W and W be admissible words and let W be 
positive. Suppose that there exists a computation ofS connecting W and W . Then if every 
reduced computation starting at W is semiproper then every reduced computation starting 
at W is semiproper. In addition, suppose that every reduced computation C starting at W 
has length < f{\W\ + \tC\) and area < g{\W\ + \tC\) for some functions f and g. Then 
every reduced computation starting with W has length < f{\W\ + \tC\) + f{\W\ + \ W'\) 
and area < g{\W\ + \tC\) + g{\W\ + \W'\). 

Proof. Let C be a reduced computation connecting W and W and let C be a reduced 
computation starting with W'. Consider the concatenation of computations CC. Let us 
represent C in the form CiD and C in the form D^^C[ where the computation CiC[ is 
reduced. Then the computation CiC[ is semiproper since it starts at W. 

Suppose that C is not semiproper. Then a negative letter inserted during a step of C 
is deleted during a later step of C. These two steps cannot both occur in C[ because then 
CiC[ would not be semiproper. Thus the insertion occurs in D^^. But then D contains 
a step removing a negative letter. Since D is a suffix of the computation C = CiD and 

is a positive word, this negative letter should have been inserted and removed during 
the computation C. Thus C is not semiproper, a contradiction. 

The facts about the length and the area can be proved similarly. Indeed, the length 
(area) of C does not exceed the sum of the lengths (areas) of C and the reduced form of 
the computation CC . □ 
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Lemma 4.2 Let S be an S-machine and W and W he admissible words. Suppose that 
there exists a computation connecting W and W, W contains suhwords wi,...,Wk and 
CS(W,wi,...,Wk) contains only one computation. Then CS{W' ,wi, ...,Wk) contains only 
one (trivial) computation. 

Proof. Indeed, suppose that there exists a non-trivial reduced computation C from 
CS{W' ,Wi, ...,Wk). Let C be the reduced computation connecting W and W. Then the 
reduced form of the computation CC is equal to C. Therefore the reduced form of C is 
trivial, a contradiction (we assumed that C" is reduced and non-trivial). □ 

We shall construct eleven S-machines Si - Sg, Sa, S^j one by one. Then we'll use these 
machines in order to construct an ^'-machine S{M) simulating a Turing machine M. 

The machine <Si has the following hardware: 1^(1) = ({'^j; {'^}) {f^}) {^D; Q{^) ~ 
{{Pi,P2,P3}, {Qi, ?2, 53}, {ri, r2, ra}, {si, S2, S3}, {^i, ^2, ^3})- The admissible words have the 
following form: 

p6'''q6''^rS'''s6''H 

where p, q, r, s, t may have indices 1, 2 or 3. 

The program P(l) consists of the following rules and their inverses. 

2- [pigi ^2^2, ri r2, Si S2, ti ta] 
3. \pidqi PsSqs, ri rg, Si S3, ti ts] 

Lemma 4.3 (Machine Si divides a number by 2 and tells even from odd.) Let n > 
and m be integers, W = pi5"giriSi5™ti and k = n/2 if n is even and k = {n — l)/2 if n 
is odd. Let W2 = P2q25''r26''s25"'t2, = p^Sq^S'^r^S'^SsS^'ts. Then 

(i) Every computation of Si starting with W is semiproper. 

(a) Every word in any computation W — Ui ^ U2 ^ ■■■ has the form piS^^qiS^^riS^^SiS'^ti 
where ii + 2^2 — n, i — 1,2, 3. 

(Hi) CSi{W,qiri) consists of one (trivial) computation. 

(iv) Si{W,p2) if and only if n is even. In this case CSi{W,p2) consists of one computa- 
tion of length n/2 and rCSi{W,p2) = W2- 

(v) SiiWjPs) if and only ifn is odd. In this case CSi{W,pz) consists of one computation 
of length {n - l)/2 and tCSi{W,p^) = W3. 

Proof, (i) Indeed, let C/i — > C/2 — ^ ■■■ be a computation. Since rules 2 and 3 and their 
inverses do not insert (delete) 5, we can assume that in this computation, we apply only 
rule 1 and its inverse. Since our computation is reduced, an application of rule 1 (resp. 
its inverse) in our computation cannot be followed by an application of the inverse of rule 
1 (resp. the rule 1 itself). Therefore, in this computation we apply either only rule 1 or 
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only the inverse of rule 1. But rule 1 cannot remove a negative power of S inserted by 
previous applications of rule 1 (because rule 1 always inserts only to the left of gi). 
Similarly the inverse of rule 1 cannot remove a negative power of 6 inserted by a previous 
application of the inverse of rule 1. This proves that a negative letter which appears 
during our computation cannot disappear, so our computation is semiproper. 

(ii) can be easily proved by induction: every rule of Si preserves the property from 

(ii). 

(iii) Suppose that a reduced computation C of Si takes a word to a word W 
containing the subword qiVi. Then by (ii) W must be equal to W. Since there is 
only one rule of Si which can be applied to a word containing p2 (resp. ps) and our 
computation is reduced, we can assume that all of the words in the computation C 
contain pi. Thus all rules applied in the computation are 1 or Since the computation 
is reduced, we can assume that only rule 1 is applied (then in the inverse computation 
only rule is applied). But each application of rule 1 inserts a new 6 between qi and 
ri, so it is impossible that the last word in the computation contains the subword qiri, a 
contradiction. 

(iv) , (v). The "if" parts of the first statement in (iv) and (v) are obvious. The "only 
if" part follows from (ii). Indeed (ii) shows that if W = Ui ^ U2,--- is a computation 
of Si then Uj has the form pi6^^ qi6^^ri6^^ Si6"^ti where ii + 2^2 = n, Now if n is odd then 
ii stays always odd and no computation can take W to a. word containing the subword 
Piqi- But if we never get this subword, we can never apply rule 2, so we never get a word 
containing p2. Similarly if n is even, ii stays always even, so we can never apply rule 3 
and we can never get a word containing p^. 

Also from (ii), it follows that if C is a reduced computation of Si starting with W and 
tC contains subword ^292 then tC = W2] if tC contains subword ^3^3 then tC = W3 . 

The fact that CSi(W,p2) and CSi{W,p3) contains at most one computation immedi- 
ately follows from (iii). Indeed, suppose that, say, CSi{W,p2) contains two computations 
Ci and C2. By (iii) the end words in these computations must be the same. So we 
can consider the superposition CiC2^. This computation takes W to W, so by (iii) the 
reduced form of this computation must be trivial, so Ci = C2. □ 

The hardware of the machine ^2 is the following: 

F(2) = r (1), Q{2) = {{pi,p2}, {qu ga}, {n, r2}, {si, S2}, {h, ^2})- 
The program P{2) consists of the following rules and their inverses: 

1. [q2 6q26~^, S2 6^^826] 

2. [g2'"2S2 giriSi,p2 ^Pi,^2 ii] 



The following lemma can be proved similarly to Lemma 4.3, so we omit the proof. 



Lemma 4.4 (Machine S2 moves q and s toward each other until they meet at r.) Let 
k, i,m > 0, n be integers, W = p25''q2S^r25'^S25"'t2. Let Wi = pi(5'^+^giriSi5'""*'"ti. Then 

(i) Every reduced computation of S2 starting with W is semiproper. 
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(a) CS2{W,W) consists of one (trivial) computation. 

(Hi) S2{W,pi) if and only if k = i. In this case CS2iW,pi) consists of one computation 
and TCS2iW,pi) = Wi. 

The machine 1S3 is a cycle of machines Si and 1S2. One can describe S3 as the machine 
obtained by taking the union of Si and S3 and identifying two state vectors of Si with 
two state vectors of 1S2. This machine will divide a number (the exponent of S) until it 
gets an odd number. In case when this exponent is 0, it will never stop. Thus ^3 will tell 
non-zero numbers from zero. 

More precisely, the hardware (^(3), Q(3)) of the machine S3 is the same as the hard- 
ware of Si. The program P(3) consists of the following rules and their inverses. We divide 
the program into two subprograms which we call steps: 

Step 1. 

1.1 [qi^ 6-^qi6^,ri ^ rVi(5] 

1.2 [piQi p2q2, n r2, Si S2, ti t2] 

1.3 [pi5qi Pa^gs, ri rg, si S3, ti tg] 

Step 2. 

2.1 [q2 6q26~\s2 ^ 6-\s2S] 

2.2 [q2r2S2 qiriSi,p2 Pi,t2 ti] 

Notice that Step 1 has the same rules as the machine Si and Step 2 has the same rules 
as the machine ^2. 

Lemma 4.5 (The machine S3 tells zero from non-zero.) Let n > and either n = 
orn = 2%2m + 1). Let W = pid^'qiriSiti. Let W3 = pa^gg^'^ra^^sg^^-^m-i^^^ j.^^^ 

1. Every reduced computation C of S3 starting with the word W or W3 is semiproper. 
The length ofC does not exceed 0{\W\ + \V\) and the area does not exceed 0{{\W\ + 
\V\f) where V = rC ^ 

2. Each of the sets CS3{W, Siti) and CS3(W3,p3) consists of one (trivial) computations 

3. S3(W,p3) if and only ifn>0. In this case CS3{W,p3) consists of one computation 
and tCS3{W,P3) = W3. The length of CS3{W,p3) is 0{n) and the area is Oin^). 

Proof. In order to prove the lemma, we shall describe all possible computations 
starting with the word W . Notice that every computation C of our machine has a repre- 
sentation in one of the following forms 

Cl,lC'2,lC'l,2C'2,2--- (7) 

■^In this paper we write that f{n) = 0{g{n)) if Cig{n) < f{n) < C2g{n) for some positive constants 
Ci, C2 and every n. We write /(n) < 0{g{n)) if /(n) < Cg{n) for some constant C and every n. 
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or 

C2, 1^1,2^2, 2 • • • • (8) 

where Cjj- is a computation of step i, i = 1,2. Without loss of generahty we can assume 
that each Cij is non-triviaL The last word in each of these blocks (except the last one) 
is the first word of the next block. 

If C consists of one block then all statements of our lemma follow from Lemmas 



and |4.4 So suppose that C has more than 1 block. 

Case 1. Suppose that n > and C has the form (^. If the last word U in Ci.i contains 
Pi then the first rule applied in 6*2,1 is rule 2.2~^. This means that U contains the subword 
qiriSi. By Lemma (iii) in this case Ci.i is trivial, a contradiction. Therefore U contains 
P2. By Lemma [4.3|(iv), U has the form 



and n is even. Since the computation Ci,i is semiproper (Lemma[4.3|(i) ) and f/ is a positive 



word, this computation is proper. By Lemma [4.3| (iv) there is only one computation in 
CSi{W,p2)- It is clear that the history of this computation is (l.l)"/2(i.2) (meaning that 
first we apply rule (1.1) ^/2 times and then rule (1.2) ). 
Similarly, if a part Ci^i of C starts with a word 

Pi5''qirisi5%, 

k > 0,i > and is followed by C2,i then k is even, the computation Ci^i is proper, the 
last word in C2 i is 

P2q25''h25'/h25% 

and the history of Ci^^ is {llf^il^). 

Now suppose that a part C2,i starts with a word of the form 

U = p2q25^r25^S25% 

and there exists Ci,j+i. Let V be the last word in C2,i- 

Suppose that V contains p2- Then the first rule in Ci,j+i is rule 1.2~^. Therefore V 
contains P2q2- Thus V has the form 

P2g25'^r25'=^S25''t2. 

Since r2 and p2 do not move during the computation of step 2, ki + k2 = 2k. Since ^2 also 
does not move, we have: i' = i. Finally, it is easy to prove by induction that during a 
computation of step 2 the difference between the distance from q2 to r2 and the distance 
from r2 to S2 stays the same. Since this difference is in the word U, we have that 
ki = ^2- This implies that ki = k2 = k, i' = i, so U = V. By Lemma ^.4| (ii) C2,j is trivial, 
a contradiction. 

Thus V contains pi. By Lemma [4.4|(iii) V must have the form 



Pi6''qiriSi6''+% 
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Since this word is positive and the computation is semiproper (Lemma |4.4| (i)) the com- 
putation is proper. By Lemma [4.5| (iii) there exists only one computation which takes U 
to V. It is clear that the history of this computation is (2.1)'^(2.2). 

Let N be the number of blocks in the representation (|^) of C. Combining the infor- 
mation obtained so far, we can conclude that the history of the first N — 1 blocks has the 
form: 

(Ll)"/2(i.2)(2.1)"/2(2.2)(Ll)"/^(1.2).... 

Now suppose that the last block C in is a computation of step 1. By what we 
have proved before, C starts with the word 

where k = n/ [2'^), c = {N—l)/2 (in this case N is odd). By Lemma [4.3| (iv) if C ends with 
a word containing p2 then k is even and the history of C is (1.1)^/2(^1.2). If C ends with 
a word containing then k is odd, that is k = 2m + 1, c = e (recall that n = 2'^{2m + 1)) 
and the history of C is (1.1)^(1. 3). If this computation ends with a word containing pi 
then the computation C is semiproper and the history is (l.l)'^ for some (not necessarily 
positive) integer d. 

Thus we proved that if = rC, C has the form (|^), n > 0, and the last block in C is 
a computation of step 1, then CS^iU^V) contains only one reduced computation of the 
form (^ which is semiproper if V contains pi and is proper if V contains p2 or p^. The 
word V contains if and only if = W^. 

Suppose now that C is a computation of step 2. Then it starts with the word 

where k = n/{2'^), c = N/2 (in this case N is even). If C ends with a word containing pi 
then it's history has the form (2.1)^^(2.2). Otherwise the history of C has the form (2.1)*^ 
for some integer d. 

Let again V = tC. Then if C has the form (^, n > 0, and the last block of C is 
a computation of step 2, then V can contain pi or p2 (it cannot contain p^), CS^{U, V) 
contains only one reduced computation of the form (|^) and C is semiproper if V contains 
Pi, and it is proper if V contains p2. 

We have also proved that in this case the computation C has one of the following 
histories: 

(l.l)"/2(1.2)(2.1)"/2(2.2)(l.l)"/4(1.2)...(l.l)'=/2(i 2) (9) 
where k = n/2^ c = {N - l)/2; 

(l.l)"/2(i.2)(2.1)"/2(2.2)(l.l)"/4(1.2)...(l.l)™(1.3) (10) 

where the number of blocks is 2e + 1; 

(1.1)"/'(1.2)(2.1)"/2(2.2)(1.1)"/4(1.2)...(2.2)(1.1)'^ (11) 
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where the number of blocks is < 2e + 1; 

(l.l)"/2(i.2)(2.1)"/2(2.2)(l.l)"/4(i.2)...(1.2)(2.1)'=(2.2) (12) 
where the number of the blocks is < 2e; 

(1.1)"/2(1.2)(2.1)"/2(2.2)(1.1)"/^(1.2)...(1.2)(2.1)'^ (13) 

where the number of the blocks is < 2e. 

It is easy to check that each of these computations has length 0(|H^| + \ V\) and area 
0((|W^| + |V^|)^) where V is the last word in the computation. 

Case 2. Suppose that n > and C has the form (^. Then the first rule applied in 
C2,i is (2.2)~^ because the other command of step 2 is not applicable. 

Therefore the second word in C2,i is 

Ui = P2S'^q2r2S2t2. 

If the block 6*2,1 ends with a word containing pi then by Lemma O this word must have 
the form 

PiS'^qinsiti 

and the computation C2,i is proper. This means that rules (2.1), (2.1)"^ are not applied 
in C2,i, so the second rule applied in 6*2,1 is rule 2.2, and this computation is not reduced, 
a contradiction. 

Thus the last word in 6*2,1 must contain p2. Then the first rule applied in 6*1,2 should 
be the rule (1.2)~^. Thus the last word in 6*2,1 must contain p2?2- By Lemma [4.4| the 
computation 6* must therefore start with an application of rule (2.2)~^ and after that rule 
(2.1)"^ is applied n times (each application of rule (2.1)^^ moves q2 one step closer to P2). 
Hence the last word in 6*2,1 and the first word in 6*1,2 is 

Vi = P2g25"r25"s25-"t2. 
Now the first rule applied in 6*1,2 must be (1.2)^^. So the second word in 6*1,2 must be 

Pigi5V25"s25-%. 



By Lemma the computation 6*1,2 is semiproper, so in the rest of 6*1,2 only one rule 
applies (several times): (1.1) or (1.1)^^. If the rest of 6*1,2 consists of applications of rule 
(1.1) then the rest of 6*1,2 cannot have a word containing either piqi or giriSi. Therefore 
6* does not have block 6*2,2- 

Thus if 6*2.2 exists then 6*2 2 must start with 



pi(5^"girisir"ti 

and the history of 61.2 has the form (1.2)^^(1. 1)~". 

Continuing in this manner (induction on the length of 6*) one can prove that the 
history of computation 6* has one of the following forms: 

(2.2)-^(2.1)~"(1.2)-^(l.l)-"(2.2)~^(2.1)~^"(1.2)-^(l.l)-2«...(2.2)-^(2.1)'= (14) 
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or 

(2.2)-i(2.1)-"(1.2)-^(l.l)-"(2.2)-i(2.1)-2"(1.2)-i(l.l)-2"...(1.2)-i(l.l)'= (15) 

where k is an integer. 

This immediately imphes that the length of C is 0(|iy| + |l^|) and the area of C is 
0{{\W\ + \V\f) where V = rC. 

Also ifV = tC, n > 0, and C has the form (H) then V cannot contain and CS^^U, V) 
consists of one semiproper computation. It is clear that this V cannot be reached from 
W by any computation of the form (0). It is also clear that any word reachable by a 
computation of the form (|^) cannot be reached by a computation of the form (|^). 

Comparing our results in cases 1 and 2, we can conclude that ii V = tC, n > then 
CSsiW, V) contains only one reduced computation, this computation is semiproper if V 
contains pi or p2, and it is proper if V contains ps. Moreover there exists only one reduced 
computation in CS^^WjPs). This computation has the form (|^) and the last word in this 
computation is W3. 

Case 3. Finally consider the case when n = 0. Then W = PiqiViSiti. Then it is 



easy to see using Lemmas and that every block Cij in the computation C, except 
the last one, has only two words, so the history of each block is of length one and the 
command used in each of the blocks except for the last one is either (1.2)^^ or (2.2)±^ 
Only the indices of letters change during the computation C until the last block. Thus the 
last block starts either with piqiViSiti or with P2'?2?"2'52^2- In the first case the last block 
either has two words and the rule applied in the last block is 1.2, 2.2~^, or it contains 
more than 2 words and the rule applied in the last block is 1.1^^. In the second case, 
either the last block has 2 words and the rule is 2.2 or 1.2^^, or it contains more than two 
words and the rule applied in this block is 2.1^^. Thus the computation C has one of the 
following forms: 



for some integer k; 
for some integers k and m; 
for some integer k; 
for some integer k. 

Since k and m are 0(|V"|), in each case the length of the computation is 0(|iy | + |l^|) 
and the area is 0{{\W\ + \V\y). 

This completes the description of C. 

Thus if n = then the set of computations CS^CWjp^) is empty, and every computa- 
tion starting with W is semiproper. 

Now all statements of the lemma can be proved easily. 

(i) We have proved that every computation starting with the word W is semiproper 
and the length and area of it satisfies condition 1 of the lemma. The fact that every 
computation starting with W3 satisfies the same properties follows from Lemma EA. 



((1.2)(2.2))^ 


(16) 


((1.2)(2.2))'=(1.1)'" 


(17) 


((1.2)(2.2))^(1.2)±^ 


(18) 


((1.2)(2.2))^(1.2)±^ 


(19) 
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(ii) The first part follows from the description of all computations starting at W. The 
second part follows from Lemma |4.2| . 

(iii) This has been established above. The length of CSs{W,ps) is 0{n/2 + n/A + ...) = 
0{n) and the area is 0{n^/2 + ri^/A + ...) = 0{n^). □ 

The machine ^4 is a concatenation of two copies of with common states ps, q'3, ts, S3 
and t^. More precisely, take a copy ^3 of the machine S^. The machine S'^ is obtained 
by adding ' to all state letters of Q(3) except P3,93,r3,S3 and ^3. Thus iSg has states 
Q'(3) = {K,P2,P3} U {gl,g^,g3} U {r[,r'^,r^} U {s;,4,S3} U {^1,^2,^3}- 

Now take the union (5(4) of Q(3) and Q'{'i) and consider the union P(4) of pro- 
grams -P(3) and -P'(3) of the machines 1S3 and 1S3. The new machine with the hardware 
(F(l), Q(4)) and program P(4) will be denoted by 1S4. 

Lemma 4.6 (The machine 1S4 tells zero from non-zero and returns all state letters to 
their original positions.) Let W = pi6"'qiriSiti, n > 0, W = p[6"'q[r[s[t[. 

(i) Every reduced computation C 0/84 starting with W or W is semiproper. The length 
of C does not exceed 0{\W\ + \V\) and the area does not exceed 0{{\W\ + 
where V = rC . 

(ii) Each of the sets CSiiW^Siti), CS^iW' , s'^t'^ consists of one (trivial) computation. 

(iii) S^iW^ s'lt'i) if and only if n > 0. In this case CS^iW^ s'lt'i) consists of one computa- 
tion and rCSiiW, s'lt'^ = W . The length of this computation is 0{n). 

(iv) S^^iW' , siti) if and only if n > 0. In this case CS4^(W' , siti) consists of one com- 
putation and rCS^iW' , siti) = W . The length of this computation is 0{n) and the 
area is Oiin?). 

Proof. All statements of the lemma will again follow from a description of computa- 
tions of ^4 starting with W . 

Every computation C of ^4 can be represented in the form C1C2C3.... where Ci is a 
computation of one of the machines ^3 or neighbor blocks are computations of different 
machines, and each block is non-trivial. 

Since C starts with the word W , Ci must be a computation of the machine ^3. If 



C = Ci then C is a computation of 1S3 and we can apply Lemma [4.5| . It is clear that in 
this case tC cannot contain p'^. 

IfC^Ci then C2 is a computation of 1S3. Therefore C2 starts with a word U containing 
P3. This word is the end word of Ci. So statement SsCWjPs) holds and by Lemma O(iii) 



n > 0, U = p36q36"^r36"^ S36"'^^"^^H3 and there exists a unique reduced computation of S'^ 
which takes U to W. Since t/ is a positive word, the computation Ci is proper (Lemma 
^4.5| (i) ). By Lemma [4.5| (i) the computation C2 is semiproper. 

Suppose that C3 exists. Then it must be a computation of ^3. Therefore it must start 
with a word containing p^. By Lemma [4.5| (ii) then C2 is trivial, a contradiction. 

Thus C consists of one or two blocks: C = Ci ot C = C1C2. In both cases C is 
semiproper. The computation C can contain two blocks if and only if n > 0. 



33 



4.1 



Now all statements of the lemma follow immediately from this description and Lemmas 
andO. □ 



We shall need a copy of S4, S4 obtained from ^4 by putting over all its state letters. 

Now take an arbitrary alphabet Y and consider the following machine S5. The admis- 
sible words for ^5 have the following form: 

EuxvFE'pAiqA2rA3sA4tpA5qAQfArsAstF' 

where E, x, F, E', p, q, r, s, t, p, q, f, s, t, F' are state letters from different Q-components, 
M G (y U Y""^)*, Aj G {6). If z is one of the letters p,q,r, s,t,p,q,f,s,i then z belongs 
to a Q-component {zq, zi, Z2, Z3, Z4, Zq, z[, Z2}. The alphabet of tape letters also includes 
letter X4 from the same component as X. 

The program of S5 consists of the following rules (one for each a G F) and their 
inverses: 

R{a) [x a-'^xa, p[ Pq, q[r[s[t[po go^oSo^o ^ qiriSih] 

The machine has the same hardware as 1S5 and just one rule: 
bo -^Pi, q'or'os'ot'oP'i qinsitipo, q[f[s[i[ go^oSo^o] 

The machine 1S7 has the same hardware as 1S5 and Sq and one rule: 

[Ex EX4, PiqiTiSitipo PAqiTiSitApi, go^oSoto qifASAti] 

It is clear that every nontrivial reduced computation of iSg or S^ has length 2. 
The computations of 1S5 are more complicated but also easy to describe. 
The proof of the following lemma is similar to the proofs of the previous lemmas and 
is left to the reader. 

We shall need one more definition. Let (Y) be the free group with free basis Y and 
let (1) be the additive group of integers. Consider the homomorphism from (Y) to (1) 
which takes every element from F to 1. Then the image of a word w from (Y) under this 
homomorphism will be denoted by \ \w\\. Thus ||w|| is the algebraic sum of the degrees of 
the letters in w. For example ||a6~^c|| = 1. 

Lemma 4.7 The following statements hold. 

(i) Any computation of S5 has a history of the form 

R{ai)R{a2y^Ria3)... 

or 

R{ai)-^R{a2)R{a3)-\.. 

where Oi eY . If z = 0102^... or z = a{^a2..., ai G Y , then the computation starting 
with a word W and having the history corresponding to z will he denoted by C (z, W) . 

(a) Every computation of is semiproper. 

(Hi) Suppose that W = EuxvFE'p6^qrstp6"^qfstF' where u,v E (F U Y~^)* and either 
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(1) (P, r, s, t) = (p'l, q[, r[, t'^) and (p, g, r, s, f) = (po, ?o, ro, sq, to) 
or 

(0) {p, q, r, s, t) = (po, q'o, rd, Sq, ^q) and {p, q, f, s, f) = (pi, ^i, n, Si, h)- 
Let C — C{z, W) be a computation of and W — tC . Then 
W = Euz-^xzvFE'pS''-'qrstpS"'+'qfsiF' 

and 

(0,0) ifW contains p'q and W contains p'^ then z = aka^^iak-2---<ii^ for some even 
number k, ai & Y , i — 1, k, e — 0. Such z will be called words of type (0,0). 
Notice that \ \z\ \ =0 

(1.0) ifW contains p[ andW contains p'^ then z — aka'^\...ai for some odd number 
k, ai E Y , i ~ 1, ...,k, e—1. Such z will be called words of type (1,0). Notice 
that \ \z\ \ — 1. 

(0,1) if W contains p'^ and W contains p'l then z = a^'^ak-i.-.ai^ for some odd 
number k, E Y , i = l,...,k, e = —1. Such z will be called words of type 
(0,1). Notice that \ \z\ \ = -1. 

(1.1) if W contains p[ and W contains p[ then z = a^^ak-i-.-ai for some even 
number k, eY , i = 1, k, e — Q. Such z will be called words of type (1,1). 
Notice that \ \z\ \ — 0. 

Notice that in each of the four cases if z is a word of the type {i,j) then \\z\\ = 
i-j^e. 

Wc shall need three simple facts about words of type G {0, 1} introduced 

in the previous lemma. 

Lemma 4.8 Let Zj, j — 1, k, be an irreducible non-empty word of type where 
e {0, 1} such that£j+i — l — ^'j for every j — l,...,k — l. Then the product ZkZk-i-.-Zi 
( in the free semigroup ) is an irreducible word. 

Proof. Indeed, an easy inspection of the definition of the words of type G 
{0, 1}, gives that if Zi is of type {i, 0) and Z2 is of type (1, then Z2 ends with a positive 
letter and Zi starts with a positive letter. If, on the other hand, Zi is of type and 
Z2 is of the type (0, i') then Z2 ends with a negative letter and Zi starts with a negative 
letter. In both cases there are no cancellations in the product -22-21. This immediately 
implies the statement of the lemma. □ 

Lemma 4.9 Let v be a positive word which does not end with letter a E Y . Then 
va~^ cannot be represented in the form ZkZk-i...Zi where Zj is a word of type {£j,£'j), 
ij, i'j e {0, 1} such that £j+i = 1 — i'j for every j = 1, k — 1, and z\ is of type (1, 0). 
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Proof. Let v = 0102. ..cim where ai G Y, i = l,...,m and am 7^ Suppose that 
va~^ = ZkZk-i-.-Zi where the zj satisfy the conditions of our lemma. Since zi must be of 
type (1,0), it cannot end with a negative letter. Therefore the product ZkZk-i---Zi must 
have cancellations. But this contradicts Lemma |4.8|. □ 



Lemma 4.10 Let v = amO'm-i---0'i be a word from with aj G Y . Suppose that 
V = ZkZk-i-.-Zi where Zj is a word of type G {0, 1} such that ij^i = 1 — 

for every j = 1, k — 1. Then k = m and Zj = aj for every j = 1, k. 



Proof. Indeed, since the product z^z^-i-.-Zi is reduced by Lemma |^^ , Zj contains no 
negative letters. But it immediately follows from the definition of words of type {i, £') 
that if Zj is not a one letter word, it contains a negative letter. Thus each zj is a one 
letter word: m = k and Zj = aj for every j = 1, m. □ 

The machine Ss is a cycle of the machines 1S4, 1S5, 1S4, Sq and 1S7. 
More precisely, Ss has the same hardware as 1S5, Sq and 1S7 and its program is the 
union of the programs of 1S4, 1S5, 1S4, Sq and 1S7. 

An informal description of Sg is the following. Suppose that it starts with a word 

W = EuxFE'pi5''qiriSitipoqofoSoioF'. 

where n > 0. In general the work of the machine can be complicated, so we describe the 
"ideal" computation assuming that u is a positive word and n = \u\. The next lemma will 
show that this is the only computation from CSsiWjP^). First Sg checks whether n > 0. 
If ri = then by our assumption u = and iSg uses rules of Sj and finishes with the word 

W4 = ExiFE'piq4r4^S4tip4_q4riS^iF' . 

If n > then uses rules of to move x one letter toward E, replaces n by — 1 and 
inserts 5 between p and q. Then uses rules of ^4 to check that the word between p 
and q is not empty (which is true), then it uses 1S4 again to check if n — 1 is not zero. If 
n — 1 = then \u\ = 1, and Ss uses 1S7 and gets the word 

W4 = Ex4UFE'p4q4r4S4^t4p4^6q4f4S4t4^F'. 

If n — 1 > then Ss uses 1S5 again and moves x one letter closer to E, replaces n — 1 by 
n — 2 and inserts another S between p and q. This process repeats until x reaches E (and 
all n letters S move to the right of p). After that Ss uses Sj and we obtain the word 

W4 = Ex4uFE'p4q4r4S4t4p46^q4f4S4i4F'. 

One can see that this is certainly not the only computation which can start with W. 
For example, at the beginning Sg can apply Sq (backward) instead of S4. Also after S5, 
the machine can proceed with using S4 instead of ^4 (if the computation of 1S5 is of the 
type (1,1) ), etc. A description of all computations of Ss starting with W is contained in 
the following lemma. 
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Lemma 4.11 (Ss tells positive words in the alphabet Y from almost positive words.) 
Let 

W = EuxFE'piS^'qiriSitipoqofoSotoF', 
W4 — Ex4uFE'piq4rASit4Pi8^q4rASit4^F\ 

n > 0. Then 

(i) Every computation C of Sg starting with W or W4 is semiproper. The length of C 
does not exceed 0{{\W\ + \V\y) and the area does not exceed 0{{\W\ + \V\)^) where 
V is the last word in the computation C . 

(a) The set CSg{W,xF, siti) consists of one (trivial) computation. 

(Hi) If u is a positive word then the statement S^iW^p/^) is true if and only if n = \u\. 
In this case CSs{W,p4) consists of one computation and tCSs{W,P4) = Wa,. The 
length of this computation is Oin^) and the area is 0{n^). 

(iv) If u has the form u'a^^ where a eY and u' is a positive words, and u' does not end 
with a, then for every n the statement Ss{W,p4) is false. 

Proof. As before, we shall describe all computations of our machine which begin with 

W. 

Let C be a reduced computation of our machine starting with W. It is clear that C 
has the form 

CiC2...Civ. (20) 

where Cj is a non-trivial computation of one of the machines 54, S5, 84^, Sq and Sr- 
Consecutive blocks should correspond to different machines. 

It is easy to see that only the last block can be a computation of S^. 

Let Ui be the first word in the block Cj, i = 1, ...,N. Then there should be rules of 
at least two machines among S4,S5,S4,Sq, Sy which are applicable to each of these words 
Ui. Easy inspection (checking all 10 possible pairs of machines) shows that Ui must have 
the form 

EviXWiFE'pS"'' qrstp5"^' qfstF' 

where Vi.Wi E {Y U Y~^)*, ni,mi are integers, and the word pqrstpqfst, which will be 
denoted by x{Ui), is equal to one of the following words: 

Pi = PiqiriSitipoqoroSoto- 
P2 = pWirWit'iPoQoroSoio- 
P3 = PWofWoPiQifiSiti- 
Pi = PoQoroSotoPiQ'As[t[. 
P5 = PiqAriSitiPiq^fiSiU. 
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We are going to prove the following statements by induction on i = 1, ...,N. All of 
them are trivial for i = 1. 
Fact 1. rii > 0. 
Fact 2. m, > 0. 
Fact 3. rii + rrii = n. 

Fact 4. Wi = ZkZk-i---Zi, Vi = uw~^ where each word Zj (j = 1, k) is of type 
and these types satisfy the following property: ij^i = 1 — i'j, j = 1, ...,k — l. The number 
k = k{i) is the number of blocks Cj, j < i, which are computations of S^. If is a 
computation of then the words zi, Zk are of type (1, 0). 

Suppose that all these fact have been proven for all numbers < i. 

Proof of Fact 1. If Cj is a computation of one of the machines 1S4, 1S4, Sq then 
nj+i = rii (for 1S4 and 1S4 this follows from Lemma for it is obvious), so n^+i > 
as desired. Thus the only non-trivial case is when Cj is a computation of 1S5. It is easy to 
see that in this case x{Ui) = P2 or x{Ui) = P3. 

Suppose first that x{Ui) = P2- There are only two machines from the list {1S4, 1S5, 1S4, 
Sq} which have rules applicable to words with x = ^2- These are and S4. Therefore 
Cj_i is a computation of S4. By Lemma then x(f^i-i) = Pi ^i-i = ni > 0. By 
Lemma |4.7| , rij+i = rij — e where e G {0, 1}. Therefore Uj+i > 0. 



Suppose now that x{Ui) = -Ps- By Lemma then n^+i = rij — e where e G {0, —1}. 
Since > by assumption, nj+i > 0. This proves Fact 1. // 

Proof of Fact 2. As in the proof of Fact 1, we can assume that Ci is a computation 
of S,. Again = P2 or x(f/.) = ^3- 

Suppose that x{Ui) = P2- Then by Lemma mj+i = mj + e where e G {1,0}, so 
rrii+i > 0. 

Now suppose that x{Ui) = Ps- There are only two machines in our list which are 
applicable to words with x = Ps- These are 1S4 and 1S5. Therefore Cj_i is a computation 



of S4. Then by Lemma |4.6| , x(f^i-i) = Pi aiid mi_i = mj > 0. Since by Lemma ^77 , 
mj+i = mj + e where e G {0, —1}, we can conclude that mj+i > 0. This proves Fact 2. 

Fact 3 is obvious because during the computation of Ss the total number of occur- 
rences of 6 stays the same. 

Proof of Fact 4. Machines ^4, 54, Sq do not change Wj and Wi. By Lemma [4.7| , 
replaces x by for some word z ^ {YU Y^-^)* of type (£, £') where i, i' G {0, 1}. This 

shows that for every i, Wi = z^Zk-i-.-Zi, zj is of type {£j,i'j), and u = ViWi, where k = k{i) 
is the number of computations of the machine 1S5 among the blocks Ci, ...,Cj_i. Let us 
prove that = 1 — £j for j = 1, k{i) — 1. 

Again we assume that this statement is proved for all numbers < i. Since only 1S5 can 
change Vi or Wi, we can assume that Cj is a computation of S5. 

Let Wj = be the above representation of Wi. 

Suppose that x{Ui) = P2. Then by Lemma ^77\, t>j+i = ViZ~^, Wj+i = zwi where z is of 
type (1, h) for some h G {0, 1}. We need to prove that i'j. = 0, that is Zk is of type {ik, 0). 
Without loss of generality we can assume that k{i) > 0. 

As we mentioned in the proof of Fact 2, in this case Cj_i is a computation of ^4. Since 
fii > (by Fact 1), we can apply Lemma and conclude that x{Ui-i) = Pi and rij > 0. 
Since k{i) > 0, i — 1 > 1. There are only two machines applicable to Ui-i. These are 1S4 
and Sq. Therefore Cj_2 is a computation of S^. Therefore x(f^i-2) = Pa- Since P4 7^ Pi, 
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i — 2 > 1. The block Cis must be a computation of S4 because 1S4 and Sq are the only 
machines in our list which have rules applicable to words with x = Pi- Since > 0, 
we can apply Lemma TJ: and conclude that x{Ui~z) = Ps- This implies that z — 3 > 1 



and Ci_4 is a computation of S5. Since Cj_4 is the last computation of S5 among the 
blocks Cj, j < i, by our assumption, Ci_4 replaced x with zj^'^xzk where k = k{i). Since 
f/j_3 = rCj_4 and x(^i-3) = A; by Lemma ^?7| , has type (^,0) for some i. Therefore 



= as desired. 



Now suppose that x{Ui) = Ps- In this case by Lemma O, Cj replaces x hj z ^xz 



where z has type (0,£') for some i' G {0, 1}. We need to prove that £'j^ = 1, that is Zk is 
of type (4, !)• 

Arguing in the same way as before, we conclude that is a computation of 1S4, 
x{Ui-i) = P4,; Cj_2 is a computation of Sq, xiUi-2) = -Pi; Ci^^ is a computation of iSi, 
x(f^j-3) = -P2; Cj_4 is a computation of 1S5. By our assumptions Cj_4 replaces x by z^^xz^. 
Since f/j_3 = rCj_4 and ^(t/i-s), by Lemma the type of is (4, 1) for some 4, as 
desired. 

Now let us prove that if is a computation of S^ then zi is of type (1,0). We 
have established that if Ci is the first block in pO| ) which is a computation of then Cj 
replaces a; with Zi^xzi. We have also proved that x{Ui) = P2 or x(t/j) = P3, so i > 1. 

If = P2 then Cj_i is a computation of 1S4, x(f^i-i) = Pi- If « — 1 > 1 then Cj_2 

is a computation of iSe, x{Ui-2) = P4 so i — 2 > 1. Then Cj_3 is a computation of 1S4, so 
Ui-3 = P3 and 2 — 3 > 1. Then Ui-4 must be a computation of 1S5. This contradicts the 
fact that Cj is the first block in (^) which is a computation of 1S5. Thus 2 — 1 = 1 that 
is 2 = 2. Therefore Cj_i = Ci is a computation of 1S4. Therefore by Lemma |4.6|, 



U2 = EuxFE'p[6''q[r[s[t[poqofoSotoF'. 

and n > 0. 

Suppose that ^(t/s) = -P2- Then by Lemma zi has type (1,1) and 

f/3 = Euz^'xz^FE'p[S''q[r[s[t[PoqofoSotoF'. 

Therefore C3 is a computation of 1S4 (it cannot be a computation of 1S7), so > 3. This 
implies that 

f/4 = Euz^^xziFE'pi6^qiriSitipoqofQSoiQF'. 

Since n > 0, C4 cannot be a computation of Sj. So > 4 and C4 is a computation of Sq 
and 

U5 = Euz^^xziFE'pQ6'^qyQSQtQp[q[f[s[i[F'. 
Hence C5 should be a computation of ^4 and Uq should be equal to 

Euz^^xziFE'p'Q6''qyQs'Qt'QPiqifiSitiF'. 



But this is impossible by Lemma ^]6| since mg = 0. This means that xi^s) = P3, so by 
Lemma [4.7| Zi has type (1,0) as desired. 

Now suppose that x{Ui) = P3. Then Cj_i is a computation of 1S4. Then x(?7j_i) = P4 
and rrij = rrii-i > (by Lemma |4^ ). Then i — 1 > 1 and Ci_2 is a computation of (Se. 
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This implies that x(f/j-2) = Pi, = f^i-i > 0. Since mj_2 > 0, i — 2 cannot be equal 
to 1 (since [/j_2 Ui = W). Therefore Cj_3 exists and is a computation of 1S4. Then 
x{Ui-3) = P2 and z — 3 > 1. Therefore Cj_4 exists and is a computation of S5. This 
contradicts the assumption that Ci is the first block in (^) which is a computation of S5. 

It remains to prove that if Cn is a computation of iSy then Zk has type (1,0). We 
know that if Cj is the last block in ( pO]) which is a computation of then Cj replaces x 
by z'l^^xzk- As before, x(f^j) is either P2 or P3. 

Suppose first that x{Ui) = P2. Then Cj_i is a computation of S4. This implies (by 
Lemma ^4.6|) that = rii^i > 0. 

Suppose that x(^i+i) = -^2- Then Zk is of type (1,1) and so n^+i = > 0. Further- 
more, Cj+i is a computation of 1S4, so x(f/j+2) = -Pi and nj+2 = n^+i > 0. Since nj+2 > 0, 
Cj_|_3 cannot be a computation of Sj. Therefore it must be a computation of Sq. Then 
Cj+4 must be a computation of 1S4 and Cj+5 must be a computation of ^5 (both i + 4 and 
2 + 5 are of course smaller than A^, since Cat is a computation of 1S7). But this contra- 
dicts the fact that is the last block in (pOf ) which is a computation of S^. Therefore 
x{Ui+i) = P3, so by Lemma ^77[ Zk has type (1, 0) as desired. 

Now suppose that x{Ui) = P3. Then again x(f/j+i) = P2 or x(?7j+i) = P3. We shall 
show that both possibilities lead to contradictions. 

Suppose first that x{Ui+i) = P2. Then is a computation of 1S4, x(f^j+2) = -Pi 
and nj+2 > (Lemma W^ )- Therefore Cj+3 is not a computation of 1S7. Hence Ci+3 is 
a computation of Sq, Ci^i is a computation of 1S4 and Ci+5 is a computation of 1S5, a 
contradiction. 

Finally let x(^i+i) = Ps- Then by Lemma the type of 2;^ is (0, 0). Since is a 



computation of 1S4, > by Lemma [4.6| . Since the type of z^ is (0,0) then by Lemma 
rij+i = nj > 0. Then Q+i, Cj+2 are computations of, respectively, 1S4 and Sq. By Lemma 
ni+2 = = ''^i+i > and xiPi+z) = Pi- Since nj+3 > 0, Cj+4 is not a computation 



of ^7, so it is a computation of ^4. Then the next block, Ci+5, is a computation of S5, a 
contradiction. 

This completes the proof of Fact 4. 

Now let us complete the proof of the lemma. 

Let C be a computation starting with W which has the form ([2D|). Let again Ui be 
the first word in Cj. By Facts 1-4 that we have proved, 

Ui = EviXWiFE'pd'^'qrstpd'^'qfstF' 

where Vi, Wi G {Y UY^^)* , rij, rrii are integers, the word x{Ui) = pqrstpqfsi is equal to one 
of the words Pi, P2, P3, P4, P5, Wi = ZkZk-i-.-Zi where each Zj is of type for some 

Vi = uwl^, Ui > 0, mj > 0. 



Then by Lemmas and [4.7| all blocks Ci are semiproper. Moreover, all Cj, i < A^ — 1, 
which are computations of the machines ^4, ^4, Sq and ^7 are proper: no negative letters 
can be inserted during these computations because in the initial and final words in these 
computations, the parts of the words which can be modified by the rules of these machines 
are positive (since > 0,mi > 0). All negative letters inserted during computations of 



between E and F belong to the subwords Zj. But by Lemma the word ZkZk-i---Zi is 
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reduced, so no negative letter inserted between E and F during one of the computations 
of 1S5 can be cancelled during another computation of 1S5. We also already know that 
the blocks Ci corresponding to S5 did not insert negative letters (5's) between E' and F' 
because these blocks are semiproper (Lemma [4.7|) and ra, and are always non-negative. 

This proves the first part of (i): every computation of Ss starting with W is semiproper. 
Let A^^i be the number of computations of 1S5 among blocks in (pOl). Then our analysis 
shows that < 4Ni and A^^i is equal to the number of subwords Zi. Thus A^^i < \tC\. If Ci 
is a computation of then the length of Ci is bounded by the length of the corresponding 
word Zi. Therefore it is bounded by the length of \tC\. The area of this block is bounded 
by |rCp. Thus the sum of lengths of these blocks is bounded by [rCp and the total 
area of these blocks is bounded by ItC^. By Lemma the length of each of the other 
blocks is bounded by 0(|rC| + (notice that the lengths of Ui do not decrease when 
i goes from 1 to A^). And again the area of each of the other blocks does not exceed 
0{{\TC\ + \W\y). This implies that the length of C does not exceed 0{{\tC\ + \ W\y) 
and the area does not exceed 0((|rC| + This proves the second part of (i). 

Let C be a computation in CSs{W,xF,siti). Let W = tC. By Fact 4 and Lemma 
|4.8| there are no blocks in the representation (|20|) of C which are computations of 
(otherwise the subword in W between x and F would not be empty). 

This implies that either C has just one block which is a computation of 1S4 or Sq, or 
it has two blocks C1C2 where Ci is a computation of Sq and C2 is a computation of 1S4. 

The latter case is impossible because then W would not have the subword siti {Sq 
changes si to Sq and 1S4 does not touch state letters without a """). 

Any nontrivial reduced computation of Sq consists of two words, and there are no 
nontrivial computations in CSq{W, Siti). Therefore C cannot be a computation of Sq. 

Finally by Lemma every computation of CS4^{W, Siti) is trivial. Thus C is not a 
computation of 1S4. All cases have been considered, so CS8(W,xF,siti) consists of one 
(trivial) computation. This proves (ii). 

Suppose that u is a positive word and n = \u\. Then there exists a computation of Sg 
which starts with W and ends with M/4. This computation has the following representation 
as a sequence of blocks: 



^^4,l(^5,1^^4,1^^6,lC'4,2---C'4,nC'5,nC'4,nC'6,nC'7,l (21) 

where Cij is a computation of Si {i = 4,5,6,7, j = l,...,n) and C^j is a computation 
of S4. Every computation of Sr, in this sequence consists of two words, so all Zi are one 
letter words. 

Now suppose that there exists a computation C in CSs{W,p4). Let W = TCSsiW,p4). 
By Facts 3 and 4, W = W^. By Fact 4, u = ZkZk-i...zi where the ZiS satisfy the properties 
listed in Fact 4. By Lemma [4.101 , since m is a positive word, every Zi is a one letter word. 
Therefore by Fact 4, every block in ( pO|) which is a computation of iSs consists of two 
words. Our description of computations of Ss starting with W (see proofs of Facts 1-4) 
show that the computation must have the form 

4,l<-^5,l'-^4,l<-^6,ltv4,2---'-^4,A;tv5,fctv4,fc'-'6,fc'-'7,l 

where k = \u\. Each block Csj consists of an application of a rule of the form R[a) which 
subtracts 1 from n, the number of 5's between p and t (here we omit indices in p and t\ 
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Other blocks do not touch this number. Since there is no S between p and t in the last 
word in Cg^fc (otherwise we would not be able to apply the rule of Sj to this word), k must 
be equal to n. 

Thus we proved (iii): if u is positive then CSs{W,p4) is not empty if and only if 
n = \u\; in this case CSsiW,p4) consists of one computation The length of this 

computation can be easily computed with the help of Lemma [4.6| (iii). It is equal to 
0{n+ {n-l) + + = 0{n^). 



Statement (iv) follows from Fact 4 and Lemma ^]9[ Indeed, by Fact 4, if CSs{W,p4) 
is not empty then u = ZkZk-i.-.zi where the 2;j's satisfy the properties listed in Fact 4. By 
Lemma ^]9| such a representation of u is impossible iiu = u'a~^ where a G F, and u' is a 
positive word which does not end with a. 

This completes the proof of the Lemma. □. 

The machine iSg is obtained from in the same way ^4 was obtained from ^3. Let 
5g be a copy of which is obtained by adding ' to all state letters except E, F, E', F' 
and those state letters which have index 4. Let iSg be the concatenation of iSg and iSg. 
Namely, the tape alphabet vector F(9) of is the same as the tape alphabet of iSg, the 
state alphabet vector (5(9) is the union of state alphabet vectors of and iSg, and the 
program of iSg is the union of the programs of and SL 



The following Lemma is similar to Lemma ^ 



Lemma 4.12 (iSg tells positive words in the alphabet Y from almost positive words and 
returns the state letters to their original positions.) Let 

W = EuxFE'pi6'^qiriSitipoqofoSotoF' 

where u is either a positive word or a reduced word of the form u'a~^ where u' is a positive 
word and a is a letter. Let 

W = EuxFE'pi6^qifiSiiiPiqQfQSoioF' 

n > 0. Then 

(i) Every computation C of Sq starting with W or W is semiproper. The length of C 
does not exceed 0{{\W\ + \V\Y) and the area does not exceed 0{{\W\ + \V\Y) where 
V is the last word in the computation. 

(a) Each of the sets CSg{W,xE, siti) and CSg{W,xE,siii) consists of one (trivial) 
computation. 

(iii) The statement Sg{W,x) (statement Sg{W,x)) is true if and only if u is a posi- 
tive word and n = \u\. In this case CS9{W,x) (resp. CSg(W,x)) consists of one 
computation and TCSg{W,x) = W (resp. TCSg{W,x) = W). The length of this 
computation is 0(n?). 

(iv) If u has the form u'a~^ where a and u' is a positive word, and u' does not 
end with a, then for every n the statement Sg(W, x) and the statement SgiW ^ x) are 
false. 
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Proof. Every computation C of iSg starting with W can be represented in the form 
C1C2C3.... where Ci is a computation of one of the machines iSg or iSg, neighboring blocks 
are computations of different machines, and each block is non-trivial. 

Since C starts with the word W , Ci must be a computation of the machine Sg- If 



C = Ci then C is a computation of Ss and we can apply Lemma [4.11| . It is clear that in 
this case tC cannot contain x. 

If C Ci then C2 is a computation of iSg. Therefore C2 starts with a word U 
containing p^. This word is the end word of Ci. So statement Ss(W,p4) holds and by 



Lemma ^4.11| (iii), (iv) u is positive and n = \u\. Now by Lemma |4.11| (i) and Lemma ^]2| 



the computation Ci is proper (it is semiproper and the end word is positive) and C2 is 
semiproper. 

Suppose that C3 exists. Then it must be a computation of iSg. Therefore it must start 
with a word containing P4. By Lemma 4.11| (ii) then C2 is trivial, a contradiction. 



Thus C consists of one or two blocks: C = Ci or C = C1C2. In both cases C is 
semiproper. The computation C can contain two blocks if and only if u is positive and 
n = \u\. 

Now all statements of the lemma follow immediately from this description and Lemmas 



Ol|, gjandgj. □ 

The admissible words of the ^'-machine Sa have the following form: 

where x is one of the letters x, Xi,X2- The rules of Sa are the following: 
(al) X a^^xa, 

(02) Ex — > Exi, 

(03) xi — > axia~^, 

(04) XiF X2F. 

Lemma 4.13 (Sa moves x from F to E and hack). Let W = Ea^xF , W2 = Ea"'X2F, 
n > 0. Then: 

1. Every reduced computation C starting with W or W2 is semiproper. The length of 
C does not exceed 0{\W\ + \V\) and the area does not exceed 0{{\W\ + |V^|)^) where 
V = tC. 

2. Each of the sets CSa{W,xF) and CSa{W2.,X2) consists of one (trivial) computation. 

3. CSa{W,X2) consists of exactly one computation with the history word 

(«l)"(«2)(«3)"(«4) 

and TCSa{W.,X2) = W2, and Exi is a subword of a word in this computation. 
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Proof. Indeed it is easy to see that the history h of every computation C starting 
with is a prefix of the following word 



for some numbers k and i. This and Lemma immediately imply statements 1 and 2 
of the lemma. 

If the history h contains (a2) then k = n because every application of (al) moves x 
one letter closer to E. Similarly if h contains (q;4) then it must contain (q;2) and i must 
be equal to n. This implies statement 3. □ 

The machine S^j is similar to Sa- 

The admissible words of the S'-machine S^^ have the following form: 
where x' is one of the letters x',x'i,X2. The rules of S^^ are the following: 

(ujI) x' — > ijJXUJ^^ ^ 

(cja) x'F'^x[F', 
(cjs) x'l uj^^x[uj, 
{UJ4) E'x[ E'x'2. 



The following lemma is similar to Lemma 4.13, so its proof is omitted. 



Lemma 4.14 (5^ moves x' from E' to F' and hack). Let W = E'xcu^F', W2 = 
E'xiUj'^F', n > 0. Then: 

1. Every reduced computation C starting with W or W2 is semiproper. The length of 
C does not exceed 0{\W\ + \V\) and the area does not exceed 0{{\W\ + \V\Y where 
V = tC. 

2. Each of the sets CS^^iW, E'x') and CSa{W2,X2) consists of one (trivial) computa- 
tion. 

3. CSu){W.,x'2) consists of exactly one computation with the history word 

K)"(^2)(^3)"(^4) 

and tCSuj{W, x'2) = W2, and x'^F' is a subword of a word in this computation. 
Now we are ready to take any Turing machine 

M = (x,F,g,e,si,so) 

satisfying the conditions of Lemma and to construct an S'-machine S{M) simulating 
M. 
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For every q G Q we denote the word quo by Fq. Let us replace qui in every command of 
M hj Fq . After that we won't have g's in any command of M because of part 5 of Lemma 
O. In order to have notation similar to that of the machine Sg, we also denote the left 



marker on tape by Ei. This gives us a Turing machine M' such that the configurations 
of each tape of M' have the form EiuFq where m is a word in the alphabet of tape and 
every command or its inverse has one of the forms: 

{Fq, ^ Fq,,...,aFq^ ^ Fq,.-.Fq, F,, } (22) 

where a eY oi 

{Fq,^Fq,,...,E,Fq^^E,Fq,^,...,Fq^^Fq,J (23) 

This machine recognizes the same language and has the same complexity functions as M, 
so we can assume that M itself has this form. 

An admissible word of the 5- machine S{M) is a product of three parts. The first part 
has the form 

E(0)a"ix(0)a"2F(0) 
The second part is a product of k words of the form 

E(z)M,x(2)t;,F(i)E'(2)p(i)A,,ig(j)A,,2r(2)Ai,3s(i)A,,4t(i)Ai,5 
p{i)Ai^Qq{i)Aijf{i)Ai^ss{i)^i,9iii)^i,ioF'{i) 

i = 1, k. The third part has the form 

E'{k + l)u<x'{k + l)u'''^F'{k + 1) 

Here Ui, Vi are group words in the alphabet Yi of tape i, and Aij is a power of 6. The 
letters 

E{i),x{i^, F{t), E'{7),p{t), q{t), r{t), s{i),t{i),p{t), g(z), f(z), s(z), t(z), F'{i) 
belong to disjoint sets of state letters which we shall denote, respectively, by 

E(z), X(^), F(^), E'(^), P(z), Q{i), R(0, S(z), P(^), Q(z), R(z), S(^), T(^), F'(z), 

z = 0, k + 1. The description of these sets is below. 

Since M is symmetric, for every command from G its inverse is in G. Let us call 
commands of the form (^) positive, the inverses of these commands will be called negative. 
We also choose one from each pair of mutually inverse commands of the form ( |23| ) and 
call it positive; the other command in this pair will be called negative. As we have seen 
before, every command in G is either positive or negative. 

Now let us describe the set of state letters of S{M). First for every i = 0, ...,k + 1 
we include the letter E{i) into E(i) and the letter E'{i) into E'(i). Then for every state 
letter F on tape i we include F into F{i) and F' into F'{i). We also include the letters 

x{i),p{i), q{i),r{i), s(i), t(i),p(i), q{i),f{{}, s{i),t{i) 
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into the corresponding sets X{i), P{i), Q(i), R(i), S{i), T(i), P{i), Q(i), R(i), S{i), T{i), 
{i = 1, ...,k). The letters that we just described will be called standard. 

Now let r be a positive command in G. Then for every 7 G {4, 9, a, u} and for each 
component U(z) of the vector of sets of state letters, we include the letter ?7(?,r, 7) into 



Suppose that r has the form (p2D: 

r = {F,, ^ F,, , aF,^ ^ F,., F,, ^ F,. }. 

for some i from 1 to k. For each S-machine 5^ where 7 G {4, 9, a, 00} we consider a copy 
S^{t) where every state letter z is replaced by z{j, r, 7) where j = z if 7 = 4, 9, j = if 
7 = a and j = + 1 if 7 = cu. We include these state letters into the corresponding sets 
P(0, Q(2), R(2), S(2), T(2), P(^), Q(2), R(2), S(z), T(z) if 7 e {4,9}, in E(0), X(0), F(0) 
if 7 = a and into E{k + 1), X(fc + 1), F{k + 1) if 7 = cj. 

The set of state letters that we just described is the set of all state letters of S{M). 

Now let us describe the set of rules of S{M). It will consist of the rules of iS4(r), Sq{t), 
Sair), Suj{t) for all r of the form ( p2D plus the following connecting rules. First let r be 



a command of the form (^21) : 



r = {Fg^ ^ Fg^, aFq^ F^/, F^^ F^J. 

Notice that r determines the letter a and the index i. 

The machine S{M) simulates this command as follows. First, using S^It), it checks 
whether the word between E'{i) and F'g_{i) is not empty. If it is empty, the execution 
cannot proceed to the next step. Otherwise the machine changes qj to q'^ in the indices 
of the F's, inserts next to the left of x{i), removes one 5 in the word between E'{i) 
and F^_{i), removes one a and removes one u. Then it uses Sa^r) and iS^(t) to move 
x{0) from F(0) to F(0) and back and to move x{k + 1) from E'{k + 1) to F'{k + 1) and 
back (the purpose of these seemingly needless moves will be clear later). Finally it checks 
(using Sq{t)) if after we inserted the word between E{i) and Fg^(i) is positive. If 
it is positive, the machine gets ready to execute the next transition of (by returning 
all state letters to their standard forms). Otherwise, the machine cannot proceed to the 
execution of the next transition from B. 

Here is the description of the connecting rules in the case when r has the form 



RaIt). This rule is applicable to an admissible word W if all state letters of W are standard, 
and W contains the subwords 



xij)F,^ij)E'ij)p{j) 

and 

q{j)rij)s{j)t{j)F:^^{j) 

for every 1 < j < k, and also the subwords x(0)F(0) and E'{k + l)x'{k + 1). It 
changes each standard state letter z{j) to z{j,T,A). The meaning of this rule is 
simple: it turns on the machine S^It). 
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R'i,a{T)- This rule is applicable when all state letters in an admissible word W have (r, 4) 
in their labels, and in addition W contains the subword s'i(i,r,4:)t'i(i,r,4:) (that is 
when 54 (r) finishes its work). It 



• changes qj to q'j in the indices oi F (j — 1, ...,k), 



• replaces 
by 

• replaces 
by 

• replaces 
by 

• replaces 

by 



x(0,T,4)F(0,T,4) 
a~^x{0, T, a)F{0, r, a), 
E'(k + 1,T, A)x(k + l,r, 4) 
E'{k + 1, T, a)x{k + 1, T, a)uj~^, 

x{i, T, 4)F,,(i, T, 4)£;'(i, T, A)p[{i, T, 4) 
a~^x(i, T, a)Fq',{i, r, a)E'{i, r, Q;)pi(i, r, q;)(5~^, 
4)r'i(i, T, 4)s;(i, T, 4)t;(i, T, 4) 



qi(i, r, a)ri(i, r, a)si(i, r, a)ti(i, r, a) 

• replaces (r, 4) by (r, a) in all other state letters. 

This rule simulates execution of the transition r and turns on the machine Sair). 

Ra,u} (t) ■ This rule is applicable when all state letters in an admissible word have (r, a) in 
their labels and 0:2(0, r, a) occurs in this word. It changes (r, a) to (r, a;) in the 
labels of all state letters, and replaces 0:2(0, r, a) by x{0,t,u!). This rule turns on 
the machine ^^^(r). 

Ru),9{t) This rule is applicable when all state letters in an admissible word have {t,uj) in 
their labels, and the letter X2{k + l,r, cj) occurs in this word. It changes (t, cj) to 
(r, 9) and replaces X2{k + 1, t,u) by x{t, 9). This rule turns on the ^'-machine Sg. 

Rg{r). This rule applies to an admissible word W when all state letters in W have (r, 9) in 
their labels, and W contains x{i,T,9), that is when Sg^r) ends its work. The rule 
removes from all letters, removes (r, 9) and indices from all state letters, i.e. this 
rule returns the state letters to their standard form. The meaning of this rule is 
that it turns off Sq{t) and gets our machine ready to simulating the next transition 
from 9. 
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We shall consider the S"- machines R^^r), i?4^a(r), Ra,u}{j)^ R^^q{t), Rgi^r) whose hard- 
ware is the same as the hardware of S{M) and whose only rules are, respectively, R^ir), 
i?4Q(r), Raujlr), Ri^gM, -Rgfr). We shall call these S-machines transition machines. 



Now let r have the form (|23D , 

In this case the simulation is much easier. It consists of just one S'-rule: 

P(r) =[F,,(1)^F,,(1),..., 

F(^)x(^)F,^(^)F'(^)p(^)g(^)r(^)s(^)^(^)p(^)g(^)f(^)s(^)^(^)F;(^) 

...,F,Sk)^h'S^)] 



For every configuration c = (FiMiF^^, E^u^Fq^) of the machine M let (j(c) be the 
following admissible word of S{M): 

F(0)a"x(0)F(0) 

E{l)u,x{l)Fq,{l)E\l)p{l)6\\-^S{l)r{l^^^^^ 
E{k)ukx{k)Fq^{k)E\k)p{k)5\\''*'\\q{k)r{k)s{k)t{^^^^ 

E'{k + l)x{k + lV"F'(fc + 1) 

where n = + ... + \uk\. Notice that |cr(c)| = 4|c| + 13k + 6. 

Conversely, if W is an admissible word for S{M), let n{W) be the word obtained by 
removing 

X, p, q, r, s, t, p, g, f, s, t, a, 6, F(0), F(0) 

and all E', F' with indices, replacing state letters by their standard forms (that is removing 
(r, j) from their labels), and reducing the resulting word. It is clear that if W is positive 
then niW) is a configuration of the machine M. It is clear also that fi{a{c)) = c. 

We shall call an admissible word W of S{M) normal if (using the notation in the 
definition of admissible words for S{M)) for every i from 1 to A; we have that UMifjH = 
1 1 Aj^i...Aj^io| I and if ni + n2 = n[ H-n'g = Z^iLi H^ii^^iH > 0. Recall that is the algebraic 
sum of the degrees of all letters in W. 

The following statement is obvious. 

Lemma 4.15 Let W be an admissible word for S{M). Then if W is positive and 
normal and one of the rules Riir), Rg^r)'^ , P{t) , r G 6 is applicable to W then W = a{c) 
for some configuration c of the machine M. 

If a transition r has the form (^2|): r = {Fg-^ Fgi^, aFg^ — F^^, Fg^ — > Fg^}, then 
the subword between an E(2)-letter and an F'(2)-letter in an admissible word of S{M) will 
be called the r-part of this word. The subword between an E'(z)-letter and an F'(2)-letter 
will be called the (r, (5)-part, the subword between an E(i)-letter and an F(i)-letter will 
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be called the (r, M)-part, the subword between an E(0)-letter and an F(0)-letter will be 
called the a-part, and the subword between an E'(A; + l)-letter and an F'{k + l)-letter 
will be called the tu-part. Notice that the work of S^It) affects only the {t,6) parts of 
admissible words and the work of Sg{T) affects only the r-parts, the work of Sair) affects 
only the a-parts and the work of S^^ affects only the w-parts. Therefore we can apply 



Lemmas [4.6| and |4.12| when we talk about S4{t) and Sg{T) even though the hardware of, 
say, Si^r) is "bigger" than the hardware of S^. 

Let Co be the accept configuration of the machine M (all tapes are empty, the indices 
of the F's form the accept vector sq). Then we can define the generalized time function 
d{n) and the area function a{n) of the 5'-machine S{M). For every n > 1 we denote 
by d{n) (resp. a{n)) the smallest number such that every admissible word W of length 
< n for which the set of computations CS{M)(W, o"(co)) is not empty, this set contains a 
computation of length (resp. area) < d{n) (resp. < a(n)). The function d{n) is called the 
generalized time function and the function a{n) is called the area function of the machine 
S{M). 

The union of the S-machines 

Ri,a (r) , 5„ (r) , Ra,u; (t) , S^{t) , R^^g^r) 

will be denoted by i?4 9(r). The hardware of i?4 9(r) is the same as for S{M). The 
following lemma is a corollary of Lemmas [4.13| and [4.14| . The machine i?4 9 is defined as 



a composition of submachines in the same manner as machines ^4 and ^9, so this lemma 
can be proved in a similar way as Lemmas ^4.6|, |4.12|, hence we omit the proof. 



Lemma 4.16 Let r be of the form Let W be a positive word to which the rule 

-R4,o(t) is applicable. Let W be obtained from W by first applying R^^aiT) and then 
replacing (r, a) in all state letters by (t, 9). Then 

1. Every computation o/i?4,9(r) starting with W or W is semiproper. The length of 
any such computation C is bounded by 0{\W\ + |V^|) and the area is bounded by 
0{{\W\ + where V is the last word in the computation. 

2. There is only one reduced computation C of R^^glr) starting with W and such that 
the rule Ruj,g{T)~^ is applicable to the last word in this computation. The last word 
in this computation is W . This computation contains two transitions such that the 
rule applied in one of these transitions contains a word Ex as one of its left sides, 
where E G E(0) and x G X(0), and the rule applied in the other transition has a 
word x'F' as one of its left sides, where x' G X(A; + 1), F' G F'{k + 1). 

3. Every computation of the machine R^^glr) starting with W and ending with a word 
to which the rule -R4,a(r) is applicable, is trivial. Every computation of R^^gir) 
starting with W and ending with a word to which Ruj.g{T)~^ is applicable, is trivial. 

The next proposition is the main statement of this section. In this Proposition, we 
included all the information about the ^'-machine S{M) that we'll use later. 
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Proposition 4.1 (S{M) simulates M.) Let M = {X,Y,Q,Q,si,so) be a Turing ma- 
chine satisfying the conditions of Lemma \3. j| . Then its time function, generalized time 
function and generalized space function are equivalent to each other, and its area function 
is equivalent to the square of the time function T{n). Let Wq = <j{cq) where cq is the 
accept configuration of the Turing machine M. Then: 

1. Every computation ofS{M) starting at Wq consists of normal words and is semiproper. 
If a rule R^^ai'T') is applied in this computation then the word W to which it applies 
is positive, the degree of a in this word is positive, and the result of the application 
of this rule is shorter than W . 

2. A configuration c of the machine M is acceptable by M if and only if S{M) can 
take (t(c) to Wq. 

3. Ifc is an acceptable configuration of M then there exists a one-to-one correspondence 
ip between all accepting computations of M starting with c and all computations of 
iS(M) connecting a{c) and Wq. This correspondence satisfies the following proper- 
ties: 

(a) For every accepting computation C of the machine M, all words in the com- 
putation ip{C') are positive. In particular, ip{C') is proper. 

(b) If the accepting computation C has length T and space S then ip{C') has length 
between eiS^ and e2TS^ and area between e^S'^ and e4,TS^ for some positive 
constants ei, 62, £3, €4. 

4. The generalized time function of S{M) is equivalent to T{n)^ and the area function 
of S{M) is equivalent to T{n)^. If there exists a computation of S{M) connecting W 
and Wq then the minimal area of such computations is < e^T'^{eQ\\W\\) + 0{\W\^) 
for some constants 65 and eg. 

5. Let C = (Wi, Wn), n > 3, be a reduced computation starting with a positive word 
Wi such that there exists a computation of S{M) connecting Wq and Wi. Suppose 
a rule Ri^aij) is applied in the transition Wi W2 and a rule RA^ai.'T') is applied in 
the transition Wn-i — > Wn. Then there exist 1 < i, j < n such that the rule applied 
in the transition Wi VFj+i contains a word Ex as one of its left sides, where 
E e E(0) and x G X(0) and the rule applied in the transition and Wj —>■ Wj^i has 
a word x'F' as one of its left sides, where x' G X(/i; + 1), F' G F'(A; + 1) . 

6. -R4,o(t)^^ are the only rules inS{M) which change the degrees of a anduo in admissi- 
ble words. The parts involving a and uo in this rule have the form xF a^^XiF and 
E'x' E'x\uj-^ where x,xi G X(0), F G F(0), E' G E'{k + 1), x',x[ G X(A; + 1). 

7. The rules involving a (resp. uo) are R^^ai^) md the rules of the form x — > a^^xa^^ 
where x G X(0) (resp. x' u^^x'u^^ where x' G X'(A; + 1)). For every x G 
X(0) U X(A; + 1) there exist at most two mutually inverse rules involving x and a 
or X and uo. 
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Proof. Let c be a configuration of tlie macliine M. Suppose tliat a computation C of 
tlie machine S{M) starts with o"(c). Notice that all rules of S{M) take normal words to 
normal words. Since a{c) is a normal word, every word in C is normal. 

The computation C can be represented in the form (|l]): 

Ci,C2,...,C;v (24) 
where each Ci is a computation of one of the machines: 

54(r),59(r),i?4,9(r),/?9(r),P(r') 
where r is any transition of M of the form (^), r' is any transition of M of the form 

(H). 

The submachine which executes the block Ci will be denoted by x{Ci)- Now we shall 
describe the sequence x{C) = (x(C'i), x(C'2)5 x(C'Ar)). 

By the definition of the transition machines it is clear that every non-trivial reduced 
computation of each of these machines contains only two words. 

Also from the definition it follows that 

a) the types of consecutive blocks Ci and Ci±i can be the following: 
i?4(r) and S4,{t), 
S/i{t) and R4q{t), 
R4,9{r) and Sq{t), 
Sg{T) and Rgir), 
Rq^t) and Ri^Ti), 
R^ij) and i?4(ri), 
Ri{T) and P(ri), 
Rg{T) and i?9(n), 
i?9(r) and P(ri). 

for some (different) r and ri, and 

b) x(C*) cannot contain subsequences where Zi, Z2, belong to the set 

{i?4(r),i?9(r),P(r) | r G 0}. 

Let us prove the following five statements by induction on i from 1 to A^. 

Al. if i < A^ and x(Cj) = S^ij) then the degrees of 5 in the (r, (5)-parts of all words in 
Ci are > and all words in Ci are positive. 

A2. if 2 < A^, x{Ci) = Sq{t) and Ci starts with a positive word then all words in Ci are 
positive. 
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A3, if i < — 2, or if i = — 1 and x{Ci) 7^ -R4,9(t), then all words in Ci are positive. 

A4. for every i, if x{Ci) = R^^gir) then the first word in Ci is positive; moreover the 
degree of a in the word W from Ci, to which -R4,a(r) is applicable, is positive, and 
the result of the application of this rule is shorter than W. 

A5. If 1< z < AT and xiC^) G {Si,Sg,R,,g} then xiQ) ^ xiQ+i). 



All five statements Al - A5 hold for i = 1 because of Lemmas [4.6| , [4.12| , [4.16 
Suppose that i < N and x{Ci) = S4. Then 

x(a+i),x(a-i)e{i?4(r),i?4,9(T)}. 

If = x{Ci+i) then Lemma ^]6| (ii) applies and by this lemma Ci is trivial, a 

contradiction. So xlQ-i) 7^ Then all conditions of Lemma apply to the 

(r, 5)-part of the first and the last words from Ci. Therefore by this lemma the degree of 
6 in the first and the last words of Ci are > 0, Cj is a proper computation and the degree 
of 6 in the (r, 5)-part does not change during Cj. Since the work of S^^t) affects only the 
(r, (5)-parts of admissible words, we deduce that all words in Ci are positive. This proves 
statement Al. 

Similarly suppose that = Sg{T). Then by Lemma |4.12| (ii), x(Ci_i) 7^ 



and the r-parts of the words from Ci are positive. This proves A2. 

Suppose that i < N — 2 and x(Cj) = R^^q^t). Then by Lemma |4.16| either the first rule 
applied in Ci is i?4,Q.(r) or the last rule applied in Ci is -R4,Q,(r)~^ (depending on whether 
x{Ci+i) is Sq{t) or x(Cj+i) is S^i^r)). In the second case all words in Ci are positive by 
Lemma [4.16| . 

Remark. Notice that in this case the degree of a in the last word in Ci is positive 
because -R4,a('r)~^ inserts a new a in the a-part of the word. Also notice that -R4,Q(r)~^ 
applied to a positive word always makes the word longer. 

Suppose that the first rule applied in Ci is i?4 Q(r). We have that i > 1 since x{Ci) G 
{i?4(r),i?9(r),P(r) | r G 6}. Clearly xiQ-i) = S,. If x(C.-i) = xiC^+^) then by 
Lemma [4.16| , Ci is trivial. Therefore = Sg. Since the statement Al has been 

proved already, the degree of 6 in the r-part of the first word of Ci is strictly positive. 
Therefore the rule -R4,a(r) does not insert a negative letter in the (r, (5)-part. 

We know that all words in C are normal. Since the first word in Ci is positive and 
normal, and the degree of S is greater than 0, the degree of a and the degree of uj in this 
word are also positive. Therefore the rule i?4^a does not insert a negative letter into the 
a-part and cu-part. 

Remark. Notice that in this case we showed that the degree of a in the word to 
which -R4,Q,(r) applies cannot be 0. Also since the degree of 6 in the (r, 5)-part of the first 
word of Ci is positive, the application of the rule R^^ai'^) makes the (r, 5)-part shorter by 
one letter. It also makes the a-part and the a;-part shorter by one letter. It can make the 
(r, y)-part longer by at most one letter and it does not touch other parts of the word. 
Therefore the resulting word is at least 2 letters shorter than the word to which this rule 
was applied. 
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By Lemma [4.16| the (r, (5)-part of the first word in Cj+i is positive. If the (r, M)-part 
of this word is not positive then it has the form 



E{i, T, 9)ua~^x{i, r, 9)F(z, r, 9) 

where w is a positive word (inherited from the first word in Ci) which does not end with 
a. 



But then Lemma |4.12| imphes that Cj+i cannot end with a word containing a subword 
stF (with indices), so Cj+2 does not exist which contradicts the inequahty i < N — 2. 
Therefore the r-part of the last word in Ci is positive. Since -R4,9(r) does not affect tape 
letters in other parts of admissible words, all words in Ci are positive. 

Suppose that i < N, x{Ci) ^ -R4,9(t) for any r, and Ci starts with a positive word. 

Then x{Ci) G {i?4, i?9, ^4, .Sg}. If 

X{C,)e{P{T),R,{T),R,{T)} 

then clearly all words in Ci are positive. If x{Ci) = S4 or x(Cj) = Sg then Al and A2 
imply that all words in Ci are positive. This proves A3. 

Suppose that x(C'i) = -R4,9(r). If i < — 2 then all words in Ci are positive by A3 
and the degree of a in this word is positive by the two remarks in the proof of A3. If 
i = N — 1 or i = N then x(Ci-i) 7^ -R4,9(t) and by A3, all words in Cj_i are positive. In 
particular, the first word of Ci (which is the same as the last word in Cj_i) is positive. If 
the rule -R4,Q(r) is applied at the beginning of Ci then x(Ci_i) = S^^r) and the degree 
of a in the first word of Ci is positive. If -R4,a('r)~^ is applied at the end of Ci then all 
words in Ci are positive and so the degree of a in the last word of Ci is positive (i?4 a(r)~^ 
inserts a new a in the alpha part of the word). This proves A4. 

Property A5 follows from A3 and Lemmas [4.6| , [4.12| , [4.16| . Indeed these lemmas imply 



that if x{Ci) e {S4,Sg,R4^g} and xiQ-i) = xiQ+i) then Ci is empty. 

Now let us prove the statements of the proposition. Notice that statements 6 and 7 
can be proved by a simple inspection of all rules of S{M), so we need to prove statements 
1-5 only. 

The fact that every word in a computation C starting at Wq is normal has been 
established before. If a rule -R4,a(r)^^ is applied in C then it is either the first rule or 
the last rule applied in a block Ci with x{Ci) = -R4,9(r). If this is the first rule then the 
word to which this rule is applied is positive by property A4. If this is the last rule then 
this rule is i?4^9(r)~^. In this case the first word in Ci is positive by property A4, and by 



Lemma [4.16| there exists exactly one computation of i?4 9(r) starting at the first word of 
Ci and ending with a word to which -R4,9(t) is applicable. In this computation all words 
are positive. Let us prove that C is semiproper. 

By statement A3, if x{Cn-i) 7^ -R4,9(t) then all words in \jf=i^ Ci are positive. In 
particular, the first word in Cjv is positive. Then Lemmas [4.6| , [4.12| , ^4.16| imply that the 



computation C is semiproper. 

If x{Cn-i) = -R4,9(t) then by statement A4 the first word in Cjy^i is positive. By 
statement A3, all words in [jfjl'^ Ci are positive. If the rule -R4,a(r)~^ is applied in Cn~i 
then the first word in Cat is positive and as before C is a semiproper computation. If 
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the rule R^^riT) is applied in Cn-i and inserts a negative letter in the r-part of 
the admissible word then x{Cn) = <S9{t), and by Lemma [4.12| the computation Cjy is 
semiproper and does not remove the letter a~^. So the computation C is semiproper. 
This proves statement 1 of the proposition. 

Notice that property A5 and the list of possible pairs (x(C*i), x(C'i+i)) imply that x{C) 
can be represented in the form Di(ri), D2{t2), D^^t^) where for each i < i, x{Di{Ti)) is 
a sequence of one of the following forms: 



(51) R4{ri),Si{Ti),R^^g{Ti),Sg{Ti),Rg{Ti) 

(52) R9{Ti),Sg{Ti),R4^g{Ti),S4{Ti),R4{Ti] 

(53) P(r,), 



and x{D() is a prefix of one of these sequences. 

Since all words in C are normal, by Lemma ^.15| , there exists a sequence of configura- 



tions C = (ci, ...,q) such that the first word in Di{Ti) is o"(cj), i = 1, Notice that if 
we apply one of the rules of the submachines R/i{T),Si{T),Sg{T) and Rgir) to a positive 
admissible word W then the corresponding configuration ^iW) of the machine M will 
not change. If W is positive and the result W of an application of -R4,Q,(r) is also positive 
then ^{W') is obtained from ^{W) by applying the transition r. Therefore in C every q 
is obtained from Ci_i by applying a transition from G. Thus C is a computation of the 
Turing machine M. 

If c is an acceptable configuration of the machine M then there exists a computation C 
of M from c to Cq. Let T1T2...T1 be the history of this computation. Then the corresponding 
computation Diiji), D2{t2), D^ijii) takes cr(c) to o"(co). Conversely, if there exists a 
computation C which takes a{c) to cr(co) then as we proved before, C = Di{Ti)...Dk{Te) 
where each Di, i < i has one of the forms (SI), (S2), (S3) and Di is a prefix of such a 
sequence. Since the last word in Di is o"(co), Dg cannot be a proper prefix. As we proved 
before, the word tiT2...T£ is the history of a computation which takes c to Cq. This proves 
statement 2 of the proposition. 

Let r be a positive transition from G of the form ( p2D and suppose that a configuration 
c' is obtained from c by applying r. Then Lemmas |4.6| , |4.12| , [4.16| imply that there exists 



only one computation of S{M) of the form (SI) which takes a{c) to cr(c') and only 
one computation of the form (5*2) which takes cr(c') to cr(c). If r has the form ( p3|) 
and c' is obtained from c by applying r then cr(c') is obtained from ct(c) by applying 
P(r). Therefore the correspondence between computations of M which take c to cq, 
and computations of S{M) which take a{c) to cr(co), is one-to-one. Let us call this 
correspondence ip as in the proposition. 

The first property of ip has been established before. 

In order to establish the second property, let C = (ci, ct) be an accepting compu- 
tation of the machine M of length T and space S. Let C = iplC). As before ip{C) can 
be represented as a sequence of blocks Di{ti), D2{t2), ...,Dt{tt). By Lemmas ^4.12| 
the length of each block Di{Ti), i = I, T, in this representation is 0(nj -|- nf) = 0{n}) 
where Ui is the length of the Tj-part of the first word in Di{Ti), i = 1,...,T. Since all 
words in the computation C are normal, the numbers rii do not exceed 0{S). Therefore 
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the length of each block Di does not exceed 0(5"^). Thus the length of C does not exceed 
0{TS'^). Similarly the area of C does not exceed 0{TS^) because the area of each block 
does not exceed S^. 

Let d be a configuration in C with maximal possible length. By definition of the 
space of a computation, this length must be equal to S. 

Then there exists a number i such that the length of the word written on tape i in 
the configuration c' is at least S/k where k is the number of tapes of the machine M. 

Let Ti...tt be the history of the computation C . Since the computation C is accepting, 
the length of the word written on tape i in the last configuration of C is (see Lemma 



3TT| , statement 4). Since every transition of the machine M can remove at most one letter 
from the word written on tape z, there exists at least S/{2k) — 1 transitions Tj^, Tj^ in 
the history of computation C such that the length of the word written on the tape i in 
configuration Cj^, n = is at least S/{2k), and the transition tj^ removes a letter 

from this word. By Lemmas |4.(j| , [4.12| and [4.16| the length of each of the corresponding 



blocks Dj^ijj^) exceeds 0(S'^/(4A;^)) and the area exceeds 0{S^ / {Sk^)). Therefore the 
total length of the computation ip{C') exceeds 0{S^) and the area exceeds 0(5'^). This 
gives us the second property of ip. 

Let us prove the fourth statement of the proposition. Let d{n) be the generalized time 
function of iS(M), and let a{n) be the area function of S{M). Let S{n) be the generalized 
space function, and let T{n) be the time function of the machine M. We know that 
S{n) is equivalent to T{n). Since every accepting computation ends with all tapes empty, 
T{n) > n. It is also clear that S{n) > n. Thus T{nY is equivalent to S{nY for every 
natural number p > 0. 

Fix a number n > 0. Let c be an accepted configuration of the machine M with 
|c| < n, such that the smallest space of an accepting computation for c is S{n). Then by 
the second property of the function ■?/', the shortest computation of S{M) connecting a{c) 
with Wq has length > eiS{nY and the smallest area computation connecting Wq and cr(c) 
has area > e^S^in) for some constants ei, 63 > 0. Since \o'{c)\ = 4|c| + 13k + 6, we have 

d(4n + 13A; + 6) > eiS^in), a(4n + I3k + 6) > esS^n) 

Since the function S{n) is equivalent to T{n), we have 

T{nf ^ din), T{nf ^ a{n) (25) 

On the other hand take any admissible word W, \\W\\ < n such that there exists a 
computation of S{M) connecting W to Wq. Take the shortest computation Ci and the 
smallest area computation C2 which connect Wq and W. We can represent Ci, i = 1,2, in 
the form %l){C[)C'l. Notice that the length of the last configuration q in C[ is smaller than 
Indeed, recall that is a composition of computations of S^ir), S'^ir), Ra^t), 
i?4,9(r), Rgi^r) for some transition r. Therefore either fi{W) coincides with q, or fi{W) 
is a configuration of M obtained from Cj by applying a command r from 6, or fi{M) is 
obtained from Cj by inserting a negative letter in one of the tapes. Thus |cj| < |yu(W^)| + 1. 
On the other hand \fJ.{W)\ < {\\W\\ - 13k - 6)/6 < \W\ -1. So \ci\ < \\W\\. 

The computation C'^, i = 1,2, is an accepting computation for the configuration q. 
Since \ci\ < n, there exists an accepting computation Ci for q of length T < T{n). The 
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space of an accepting computation of length T cannot exceed T (because in the last 
configuration all tapes are empty and every step of a computation can remove at most 
one cell). By the second part of our proposition, the length of ip{Ci) does not exceed 
e2T^ < e2T(n)^ and the area of this computation does not exceed e^T'^ < e4T(n)*^. for 



some constants €2, €4. By Lemmas [4.6|, |4.12|, |4.16| the length of Cf , i = 1,2, does not 



exceed 0(|iy | + \W\y) and the area does not exceed 0((|W| + |W^|)^) where W = a(cj). 
Therefore the length of C", i = 1,2, does not exceed 0(|iyp) and the area does not 
exceed 0(|iyp). The computation ip{Ci)C'{ takes Wq to W. So it cannot be shorter than 
the computation Ci. Therefore the length of Ci does not exceed 0(T^( | |)) + 0(|iyp). 
Similarly, the area of C2 does not exceed the area of ip{C2)C2- So the area of C2 does not 
exceed 0{T{n)^) + 0{\W\^). This proves the second part of statement 4 of the proposition. 
Since \ \W\ \ < \W\ and T{n) > n, we deduce 

d{n) ^ T{nf, a{n) < T{n)\ (26) 

The inequalities ( P5| ) and ( P^D imply that d{n) is equivalent to T{n)^ and a{n) is equivalent 
to T{nY. This completes the proof of statement 4 of the proposition. 

Let us prove property 5 of the proposition. Let C = (Wi, ...,Wn), n > 3, be a 
reduced computation of S{M) starting with a positive word Wi such that there exists a 
computation C of S{M) connecting Wq and Wi. Suppose a rule -R4,a(r) is applied in the 
transition Wi — > W2 and a rule -R4,a(r') is applied in the transition Wn-i —>■ Wn- Let 
C = CC and C = C"C~^ for some computations C, C, C" where C" is a maximal prefix 
of the computation of C which cancels when we reduce the computation CC . Then C"C 
and C"C' are two reduced computations starting with Wq. 

Suppose first that C is not empty. 

Then as before we can decompose the computations C"C~^ and C'C into a sequence 
of blocks (p^ . Since the rule R^^aij) is applied in the transition Wi W2, the block B 
of the computation C'C containing Wi is such that x{B) = R^ gi^r). Similarly Wn-i, Wn 
are contained in a block B' of the computation C'C~^ such that x{B') = R^^gir'). Since 
there is only one rule from i?4 9(r') applicable to Wn-i, the block B' has length 2. 

If B is contained in C~^ then by Lemma |4.16| , C contains two transitions Wi —>■ Wi+i 
and Wj —>■ Wj+i such that the corresponding rules have Ex and x'F' as their left sides 
where E e E(0) and x G X(0), x' G X(A; + 1), F' e F'{k + 1) as desired. 

Suppose that C*~^ is not empty and is contained in B. Let Ti, T2 be the first two words 
of the block B. Let B" be the block of the computation C'C containing Ti and T2. Since 
the rule applied in the transition Ti — > T2 belongs to i?49(r), x{B") = R/i^q{t). Since the 
block B' has length 2, the block B" is not the last block in the computation C'C. By 
property A3, Ti is a positive word. Therefore B" must end with an application of the rule 
i?4^Q,(r)^^. By Lemma [4.16| , there is only one reduced computation of the machine Ri^Q^r) 



which starts with Ti and ends with a word to which -R4,q(t) is applicable. Thus B = B". 
But this contradicts the assumption that C is the longest part of C which cancels in the 
product CC. 

Finally suppose that C is empty, that is, the computation is reduced. Then the 
words Wi and W2 are contained in a block B of the computation CC and x{B) = R^^g^r). 
As before this is not the last block in the computation CC because Wn-i must belong to 
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another block. Therefore we can apply Lemma ^.16 
desired transitions. 

This completes the proof of the proposition. □. 



and conclude that B contains two 



5 The Group Presentation 

In this section we convert the S"- machine S{M) into a group presentation. 



Let S = S{M) be the S*- machine described in Proposition [4.1| . Let Y be the vector 
of sets of tape letters, and let Q be the vector of sets of state letters of S. The vector Q 
has 15k + 6 components which we shall denote by Qi, Qi^k+e- Notice that Qi = E(0), 
Q2 = X(0), Q3 = F(0), Qi5fc+4 = E'(fc + 1), Qi5fe+5 = X(fc + 1), Qi5fc+6 = F'(fc + 1). 

Let Wq be the same admissible word as in Proposition We shall not use the whole 
definition of S, only Proposition O. 



Let G be the set of rules of S. Let us call one of each pair of mutually inverse rules 
from O positive and the other one negative. The set of all positive rules will be denoted 
by 6+ and the set of all negative rules will be denoted by 9_. 

Let N be any positive integer. Let 

15k+6 k 

A= [j Q^U{a,iu,S}u\JYiU{KJ\ J = l,...,2N}ue+. 

1=1 i=l 

Our group Gj\f{S) is generated by the set A subject to the set VNi<S) of relations 
described below. 

1. Transition relations. These relations correspond to elements of 0+. 

Let r e 0+, T = [Ui Vi, Up — ^ Vp]. Then we include relations Ul = Vi, = 
Vp into Vn{S). Here x"^ stands for y~^xy. If for some j from 1 to 15k + 6 the letters 
from Qj do not appear in any of the Ui then also include the relations gj = qj for every 

Qj e Qj- 

2. Auxiliary relations. 

These are all possible relations of the form tx = xr where x E {a,uj,5} U Ui=i^i; 
r G 0+ and all relations of the form tKi = /tjT, i = 1, 2N, t G 0+. 

3. The hub relation. 

For every word u let K{u) denote the following word: 

K(u) = {u~^ KiUK2U~^ . . . U~^K2N-lUK2N)if^2NU~^l^2N-lU ■ ■ ■ K2U^^ Kiu)^^ . (27) 

Then the hub relation is 

K{Wo) = 1. 

Figure 1 shows the diagram corresponding to the hub relation if = 1. 
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Fig. 1. 



We define the presentation Vn{<S) by taking the set of all the relations defined above, 
their cyclic shifts and inverses of these cyclic shifts. We shall say that a relation is 
determinedhj some characteristic (a subword, a pair of letters, etc.) if it is determined up 
to taking cyclic shifts and inverses. Cyclic shifts and inverses of transition (resp. auxiliary 
and hub) relations will also be called transition (resp. auxiliary and hub) relations. 

The group G!\f{S) with which we will be working in this paper is given by the presen- 
tation Vn{S). 

The following lemma contains some properties of Vn{S) which either follow from the 
definition of S'-machines or are easy to verify. 

Lemma 5.1 a) Every relation except the huh contains exactly two Q^^-letters: a letter 
from and a letter from 0^^. 

b) If transition relations si,S2 € Vn{<S) have a common Q-letter and a common Qj- 
letter then si is a cyclic shift of sf^. 

c) If two auxiliary relations Si, S2 G Vn{S) have a common 2-letter prefix then Si = S2- 

d) If Si and s^^ are two cyclic shifts of the hub or its inverse and have a common 
prefix of the form KjWoKs then si = S2- 



6 Some Basic Definitions 



We mainly use notation and definitions from Ol'shanskii |^. In particular, we use the so 
called 0-refinement of a van Kampen diagram. We do not want to define it here precisely 
recall only that a 0-edge is an edge labelled by 1, and a zero-cell corresponds to a relation 
of the form aV^a~^V^ where a is one of the generators, m and n are integers. One can 
insert zero-edges and zero-cells in a diagram in order to separate two paths which touch 
each other, or in order to make a path, which touches itself, simple. When we compute 
an area or diameter of a diagram we never take zero-cells and zero-edges into account. 

If A is an (ordinary) van Kampen diagram then i9(A) denotes its boundary. If A is 
an annular diagram then (9o(A) and di{A) denote the outer and the inner boundaries of 
A. The cells of diagrams over Vn{S) will be called by the names of the corresponding 
relations: transition cells, auxiliary cells, hub cells (or simply hubs). 

We always assume that boundaries of van Kampen diagrams and boundaries of cells 
are oriented clockwise (this is an insignificant difference with |2^). The contour oi a cell 
or a diagram is the union of the boundary and and its inverse. 
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A path in a diagram A is called simple if it does not cross itself. A path is called reduced 
if it does not contain consecutive mutually inverse edges. Every path in a diagram can 
be reduced by removing subpaths of the form ee~^. It is mentioned in |Q that by a 
0-refinement one can turn every simple path into an absolutely simple path which does 
not cross and does not touch itself. 

The length of a path p, denoted by \p\ is the number of non-zero edges in it. 

We shall use the following operations which can be performed on arbitrary diagrams. 

Taking the inverse (mirror image). Let A be a van Kampen diagram over a 
symmetric set of defining relations V. Consider the mirror image A~^ of the graph A 
with respect to some straight line. Since V is symmetric, the graph A^^ is again a van 
Kampen diagram over V. We call A~^ the inverse of A. It is easy to see that every two 
inverses of the same diagram A can be transformed into each other by a homotopy of a 
plane, so they are homotopic. 

Composition. Let Ai and A2 be van Kampen diagrams over a presentation V. Let 
d{Ai) = pip[, d{A2) = P2^P2- Suppose that Lab(pi) = Lab(p2) in the free group. Here 
Lab(p) is the label of the path p. Then by a zero refinement we can make both paths pi 
and P2 absolutely simple and the labels of pi and p2 identical. After that we can glue Ai 
and A2 by identifying the corresponding edges of pi and p2- This operation will be called 
the composition and the resulting diagram will be denoted by Ai Opi=p2 ^2- 

Let A be any van Kampen diagram. The following definition is similar to the definition 



of a dual graph of a diagram 0], p3[. Fix a point in each of the cells in A and a point 
in the inside of each of the edges of A. For each cell tt and each edge e on the boundary 
of this cell, fix a simple polygonal line e) inside tTj which connects the point inside vTj 
and the point inside e. We can choose these lines in such a way that £{71, e) and i{n, e') 
do not have common points except for the fixed point inside vr. 

The next definition of a band in a diagram is crucial for our paper. 

Let S* be a set of letters and let A be a van Kampen diagram. Fix pairs of S-edges in 
some cells from A (we assume that each of these cells contains at least two S-edges). 

Suppose that A contains a sequence of cells (tti, . . . , 7r„) such that for each i = 
1, . . . ,n — 1 the cells vTj and vTj+i have a common S'-edge and this edge belongs to the 
pair of S-edges fixed in vTj and vTj+i. Consider the line which is the union of the lines 
i{ni, Ci) and ^(vrj+i, ei), i = 1, . . . ,n — 1. This polygonal line is called the median of this 
sequence of cells. Then our sequence of cells (tti, . . . , 7r„) with common edges ei, . . . , e„_i 
is called an S-band if the median is a simple curve or a simple closed curve. 

We say that two bands cross if their medians cross. We say that two bands touch each 
other if their medians touch each other. We say that a band is an annulus if its median 
is a closed curve. 

Let B be an S-band with common edges ei, 62, • • • , e„ which is not an annulus. Then 
the first cell has an S'-edge e which forms a pair with ei and the last cell of B has an edge 
/ which forms a pair with e„ . Then we shall say that e is the start edge of B and / is the 
end edge of B. If p is a path in A then we shall say that a band starts (ends) on the path 
p if e (resp. /) belongs to p. 

If S and T are two disjoint sets of letters, (vr, tti, . . . , 7r„, tt') is an S-band and (vr, 71, 
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. . . , 7m, tt') is a T-band then we say that these two bands form an {S, T)-annulus if the 
medians of these bands form a simple closed curve called the median of the {S, T)-annulus; 
and in addition the start and end edges of these bands are not contained in the polygon 
bounded by this median. 

If £ is the median of an 5'-annulus or an {S, T)-annulus then the maximal subdiagram 
of A contained in the area bounded by i is called the inside diagram of the annulus. 

The union of cells of B forms a subdiagram (perhaps after some 0-refinement). The 
reduced boundary of this diagram, which we shall call the boundary of the band, has the 
form e^^p/^^g^^ (recall that we trace boundaries of diagrams clockwise). Then we say 
that p is the top path of B, denoted by top{B), and q is the bottom path of B, denoted by 
bot(B). 

Suppose that an S'-band i5 is a union of two S'-bands Bi and B2 and the end edge 
of Bi coincides with the start edge of B2. Then the top (bottom) path of B is obtained 
by multiplying the top (bottom) paths of Bi and B2 and reducing consecutive mutually 
inverse edges. 

li B — (tti, . . . , TTn) is an 5'-band then (tt^, 7r„_i, . . . , tti) is also an S'-band which we 
shall call the inverse of B. We shall denote this band by B~^. 

We shall call an S'-band maximal if it is not contained in any other S-band. If an 
S-band W starts on the contour of a cell tt, does not contain tt and is not contained in 
any other S-band with these properties then we call W a maximal S-hand starting on the 
contour of n. 

Now let us return to our presentation Vn{S). Consider the following partition of the 
generating set A: 

e U Qi U ... U Qi5fc+6 U U U U {k2n}- (28) 

Notice that these subsets are disjoint (the fact that Qi, •••,Qi5A:+6 are disjoint follows 
from the definition of S-machines) . 

Let S be a block of this partition. In order to consider S-bands and annuli, we need 
to divide i?-letters of some relation of Vn{S) into pairs. 

Every relation except for the hub contains a letter from G and the inverse of this letter. 
These letters form a ©-pair. 

Let us denote {a,uj,5} U U^i by Y. Every auxiliary relation containing a F-letter, 
contains a F-letter and the inverse of this letter. These letters form a F-pair. 

Every auxiliary relation containing Ki contains also These two letters form a 

Kj-pair. 

If a transition relation has a letter from Qj then it has exactly two letters from Qf^ 
(this follows from the definition of an S-rule). They form a Qj-pair. 

The hub relation contains an occurrence of Ki and an occurrence oi k~^. These occur- 
rences form a Kj-pair in the hub. 

A Kj-band is called even (resp. odd) if j is even (resp. odd). The number j will be 
called the index of this ^t-band. 
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7 Forbidden Bands and Annuli 



In this section we shall consider van Kampen diagrams with a fixed boundary label over 
the presentation Vn{S). Let A be such a diagram. 

For every j = 1, . . . ,2N we define an automorphism of the group G'j^{S) given by 
the presentation Vn{S) without the hub. Take a perfect 4A^-gon with boundary labelled 
by the word K{1) = {i^i...H2n){K'2N ---i^i)'^ (see formula (p^) for the definition of K{u)). 
Consider the refiection with respect to the axis passing through the mid points of the 
opposite edges labelled by nf^. Then this refiection induces a permutation (pj on the set 

{k1,...,K2Jv}U{K];\...,K2^} 

which satisfies the property (l)j{K^^) = (pj^Ki)'^. We extend to a permutation of the 
set A U A^^ by fixing all other letters from A U A~^. It is easy to see that (f)j takes every 
relation from Vn{S), except for the hub, to a non-hub relation from Vn{<S)- Therefore 
these maps induce automorphisms of G'j^{S) which will be also denoted by (pj. One notices 
also that these automorphisms are involutions (each one is its own inverse). 

Lemma 7.1 If A contains a Hj-annulus then there exists a van Kampen diagram with 
the same boundary label as A, smaller area and the same or smaller diameter. 

Proof. Consider each K^-cell as a Kj-band. Then it is easy to verify (see the list of 
relations in the presentation) that the label of the top and the label of the bottom paths 
of this band are images of each other under the automorphism (pj of the free group. 
Therefore the boundaries of the annular diagram formed by any Kj-annulus JC have labels 
one of which is the image of another one under (pj in the free group (when we formed the 
boundaries of the annular diagram, we reduce pairs of consecutive mutually inverse edges 
and this does not change the values of the labels of these boundaries in the free group). 
We can assume that JC is an innermost Kj-annulus, that is, there are no fcj-annuli inside 
the subdiagram A' bounded by the median of /C. Since the contour of /C contains no 
Kj-edges, the subdiagram A' does not contain Kj-edges (otherwise the maximal /Cj-band 
containing this edge would have to be an annulus). This implies that A' does not contain 
hubs. That is. A' is a van Kampen diagram over the presentation of G'^{S) (see Figure 
2). 
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Fig. 2. 

Let us apply (f)j to all labels of A'. Since takes relations of G'j^{S) to relations of this 
group, the subdiagram A' turns into another van Kampen diagram 0j(A') over 'Pjv(«5). 
We can also notice that the labels of the inner and the outer contours of our annulus are 
now the same in the free group. By applying a 0-rcfincmcnt one can make these labels 
graphically equal. After that one can identify these contours and remove all the cells of 
the annulus from the diagram. The result will be a van Kampen diagram E with the 
same boundary label as A and smaller area. 

It is also easy to show that the diameter of the diagram can only decrease when we 
remove a K-annulus. Indeed, with every path p in A one can associate a path p' in E 
conecting the same vertices. This path is obtained by removing some edges (the common 
edges of the removed annulus) and identifying the end vertices of these edges, and by 
relabeling the parts of p which are contained in the subdiagram bounded by the annulus. 
The length of p' can be only smaller than the length of p. Therefore the diameter of E 
can be only smaller than the diameter of A. □ 

Now we can define reduced van Kampen diagram over Vn . 
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Definition 7.1 A van Kampen diagram over a presentation V is reduced if it does not 
contain any of the following: 

1. A 0-edge (that is an edge labelled by 1). 

2. A reducible pair of cells (see Figure 3): 




Fig. 3. 

Here v is not empty. 
3. A Kj-annulus (for any i). 



Lemma |7.1| shows that if A has a K-annulus then its area can be decreased without 
increasing the diameter. It is easy to see that if A contains a reducible pair of cells, its 
area can decrease too: both cells can be removed by removing the path labelled by u 
and identifying the paths labelled by v. Unfortunately the diameter of the diagram can 
increase. This does not cause any difficulty in computing an upper bound for the smallest 
isodiametric function, but this makes it more difficult to find a good lower bound (see 
Section [T^) . 

For each word w which is equal to 1 in our group Gn{S) there exists a reduced diagram 
with boundary label w. Indeed, it is enough to take the van Kampen diagram with 
minimal number of cells and boundary label w (such a diagram exists by van Kampen's 
lemma) . 

Definition 7.2 An annular van Kampen diagram ^ over Vn{S) will be called reduced 
if any ordinary van Kampen diagram obtained by cutting ^! along a simple path connecting 
9o(\E') with (9j(\&) is reduced. 

Notice that if we obtain an annular diagram \I' by removing a subdiagram from a 
reduced van Kampen Diagram A then ^ is reduced. 

For the rest of this section we shall assume that A is a reduced diagram over Vn{S). 

Lemma 7.2 //A is reduced and does not have hubs then it does not have Qj-annuli 
for any j = 1, 15k + 6. 

Proof. Suppose there is a Qj-annulus. Let Q be the Qj-annulus with the smallest 
inside diagram A'. Since Q-annuli cannot cross (this follows from the definition of an 
S'-rule), A' does not contain Q-cells (otherwise we would get a Q-annulus with smaller 
inside diagram). Therefore all cells in A' are auxiliary. Since every cell in Q has exactly 
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two Qi-edges, the boundary d{A') contains no Q-edges. Thus A' is a diagram over the 
presentation consisting of all auxiliary (commutativity) relations. The group H given by 
this presentation has a homomorphism onto the direct product of the free group on G and 
the free group on F U {a, a;, ki, . . . , K2n}- Therefore the word u = Lab(c}(A')) is equal to 
1 in this direct product. Every Q-cell in Q, considered as a Q-band, contains exactly one 
9-edge on the top path (resp. on the bottom path). If two of these 9-edges cancel when 
we form the top/bottom path of Q, then the corresponding cells of Q cancel, by Lemma 
|5.1| (b). This cannot happen since A is reduced. Therefore u contains a B-letter. 

This implies that u contains a subword of the form t^^vt"^^ for some letter r G 0+ 
and some word v which does not contain G-letters. Then the G-letters in this subword 
must label edges of consecutive cells vr and n' in Q (again we use the fact that 9-edges 
do not cancel when we form the top/bottom path of Q). Then the cells vr and vr' cancel. 
Indeed by Lemma (b) the relation si corresponding to the cell tti is a cyclic shift of 
sf^ where S2 is the relation corresponding to 712- We can assume that we start reading 
Si and S2 at the beginning of their common Qj-edge. Since the 9+-edges on the bottoms 
of these cells are oriented toward each other, we conclude that Si = and each of the 
words Si and S2 contains only two 9-edges, Si = 82^. So the cells tti and 7^2 cancel. 

This again contradicts the assumption that A is reduced. □ 

Lemma 7.3 If A is reduced, does not have hubs and d{A) consists of Q -edges then A 
does not have cells. 



Proof. Indeed, by Lemmas |7]2| and A has no Qj-annuli. Thus it cannot contain 
Qj-cells. Therefore all cells in A are auxiliary. Hence, as in the proof of Lemma 7^ , the 
boundary label of A is equal to 1 in the free group generated by 9. Since A is reduced, 
it does not contain cells. □ 



Lemma 7.4 If A is reduced and does not contain hubs, then it does not have [Qj, 9)- 
annuli. 



Proof. Suppose that A contains a {Qj, 9)-annulus Wi U W2 where Wi is a Qj-band 
and VV2 is a 9-band. Let A' be the inside diagram of this annulus. We can assume that 
A' has the smallest possible area. 

Then the contour of A' does not contain Q-edges (otherwise there would be a {Qi, R)- 
annulus with a smaller inside diagram). Therefore A' does not contain any transition cells 
(by Lemma |7.2| there are no Q-annuli). So it can only contain auxiliary cells. Thus A' 
is a diagram over the group H given by the auxiliary relations only. We know from the 
proof of Lemma [T^ that this group has a homomorphism onto the direct product of the 
free group over 9 and the free group generated by other letters (not from U Qi U 9). Thus 
if we remove all non-9-letters from the label w' of the boundary of A', we get a word 
which is equal to 1 in the free group generated by 9. Therefore w' contains a subword 
t = T^^vT^^ as in Lemma \l.2\ or else the contour of A' does not have 9-edges at all. 

Suppose first that the contour of A' contains 9-edges. The part of the contour of A' 
which is contained in the contour of W2 cannot contain 9-edges because the contour of 
each 9-cell contains exactly two 9-edges which form an opposing pair. Thus both r's in 
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the subword t must be labels of some edges on the part of d{A') which is contained in 
Wi. These two r's must belong to two neighboring transition cells, so A is not reduced 
(we apply the same argument as in Lemma |7.2|) . 

Now suppose that d{A') does not contain 0-edges (see Figure 4). 




Fig. 4. 



Then Wi contains just 2 cells, the intersection cells of Wi and W2- Then the edges 
labelled by r and r' on Fig. 4 belong to W2. The r's in every B-cell of W2 are the same. 
Therefore r = r'. This implies that the two Q-cells in Wi form a reducible pair (Lemma 
|5.1| b) ), a contradiction. □ 

Lemma 7.5 Suppose that (not necessarily reduced) A contains a {K,j,Q)-annulus A 
consisting of a Kj-band Wi and a Q-band VV2. Suppose that Wi does not have hubs. Then 
Wi contains a reducible pair of cells which belong to the same Q-band. 

Proof. If there is a Kj-cell in Wi between the intersection cells pi and p2 of Wi and 
VV2 then this cell will belong to a 0-band which forms a {kj, G)-band with a part of Wi 
and the inside diagram of this annulus would be smaller than the inside diagram of A. 
Therefore we can assume that the cells pi and p2 are consecutive cells on Wi. The start 
and the end edges of W2 form an opposing pair with the same label. Therefore the cells 
pi and P2 correspond to mutually inverse relations and have a common Kj-edge. Thus 
these cells form a reducible pair and belong to the same 0-band. □ 

Lemma 7.6 If A is reduced and does not contain hubs then it does not contain B- 
annuli. 

Proof. Take an innermost G-annulus B. Let A' be the inside subdiagram formed by 
this annulus. Then A' does not contain G-annuli. Since the contour of A' contains no 
6-edges, the diagram A' contains no B-edges either. Since the only cells that contain 
no 0-edges are the hubs (which were ruled out here). A' contains no cells. Therefore 
Lab{d{A')) = 1 in the free group. Since A contains no (Q, B)-annuli (Lemma |7.4|) , 
d{A') contains no Q-edges. Therefore d{A) contains only (F)-edges and all cells of B are 
auxiliary. Since Lab(5(A')) = 1 there are two neighboring edges in d{A') with mutually 
inverse edges. The cells of B containing these two edges form a reducible pair, by Lemma 
|5ri| (c). This contradicts the assumption that A is reduced. □ 
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Lemma 7.7 If A is reduced, does not contain hubs and <9(A) has no Q-edges then A 
does not have cells. 



Proof. Since A contains no B-annuli (Lemma |7.6| ), the assumptions of the Lemma 
imply that A contains no 6-edges at all. Since every relation in Vn{S) except for the 
hub contains letters from G, the diagram A has no cells. □ 

Lemma 7.8 If A is reduced and contains no Q-edges then it does not have (F, O)- 
annuli. 

Proof. Since A contains no Q-edges, all cells in A are auxiliary. Take an innermost 
{Y, 0)-annulus. Let A' be the inside diagram of this annulus. Then d{A') = pq where 
Lab(p) contains no 9-edges and Lab(g) contains no F-edges (p is the bottom or the top 
path of a 0-band, q is the bottom or the top path of a F-band). It follows from the 
structure of auxiliary F-cells that the top and the bottom paths of every non-empty Y- 
band must contain G-edges. Therefore either q contains G-edges or q is empty. The first 
case is impossible because our [Y, G)-annulus was innermost. Therefore q is empty. Then 
d{A') contains no G-edges. By Lemma \l.7[ A' contains no cells. Therefore Lab{d{A')) = 1 
in the free group. This implies, as before, that our [Y, G)-annulus is not reduced. □ 

Lemma 7.9 If A is reduced and does not have hubs then it does not have Y-annuli. 

Proof. Suppose that A has a F-annulus. Take the innermost F-annulus 3^. Then the 
outer contour p of this annulus cannot contain Q-edges since a F-band cannot contain 
transition cells (by the definition of F-bands). Lemma implies that the subdiagram 
bounded by p does not contain Q-edges. It also cannot contain k- or G-edges because of 



Lemmas u^M and 7.6. Thus the diagram bounded by p must be empty, a contradiction 



(this diagram contains y). □ 



8 Diagrams Without Hubs 

In this section, we estimate the area and diameter of a diagram without hubs over the 
presentation Vn{S) and also consider the process of reducing a non-reduced diagram 
without hubs. 

With every diagram A over Vn{<S) we associate two numbers: n{A) is the length of 
the boundary of A and h{A) is the number of hubs in A. 

Lemma 8.1 Let A be a reduced diagram without hubs over the presentation Vn{<S), 
with perimeter n = n{A) and with m maximal Q-bands. Then m < n and the area of A 
is at most Cri^m where C is a constant. Therefore the area of A does not exceed Cn^. 
The diameter of A does not exceed C'n for some constant C . 

Proof. Every cell in A belongs to a maximal G-band. Since A contains no G-annuli 
(Lemma |7.6|) , each of these G-bands starts and ends on the contour of A. Therefore there 
are at most n maximal G-bands. A similar reasoning shows that m <n. 
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Every transition cell is an intersection of a G-band and a Qj-hand. Since A does not 
contain {Qj, 0)-annuli (Lemma [7. 41) , each G-band intersects each Qj-band at most once. 
Therefore A contains at most mn/4 transition cells (the total number of intersections is 
bounded by mn but each transition cell is counted at least 4 times because it belongs to 
two mutually inverse Qj-bands and to two mutually inverse 0-bands). 

Let $ be the set of all maximal F-bands. Every non-transition cell with F-edges in A 
belongs to two mutually inverse bands in $. Therefore we need to compute the number 
of bands in $ and their lengths. 

Since A does not contain F-annuli (Lemma |7.9| ), each of the F-bands in $ starts 
either on the contour of A or on the contour of one of the transition cells. The number 
of F-edges on a transition cell is bounded by a constant. Therefore the number of F- 
bands in $ is at most Cmn for some constant C. Every cell in each of the bands in $ 



is an intersection of a y-band and a 0-band. By Lemma 7.8, each of the F-bands in 



$ intersects each of the G-bands in A at most once (we use the fact that F-bands in $ 



do not contain transition cells, and Lemma [7.4|) . So the length of each of the bands in 



$ does not exceed n/2. Therefore the number of non-transition F-cells in A does not 
exceed Cimn^ for some constant Ci. 

Each non-transition cell which does not contain F-edges is the intersection of a G- 
band and a /tj-band. Since A does not contain K-annuli, the total number of maximal 



K-bands (including inverses) is at most n. By Lemmas |7.5| each G-band intersects each of 
the K-band at most once. Therefore the number of non-transition cells without F-edges 
in A does not exceed n^/4. 

Thus the total number of cells in A does not exceed Cimn^ + n^/4 < C2mn^. 

In order to estimate the diameter of A pick a vertex v in A. Let d be the distance 
from V to the boundary of A. It is clear that n/2 plus the maximal among these (i's for all 
vertices v is the upper bound for the diameter of A, so we need to estimate d. Without 
loss of generality we can assume that v is not on the boundary of A. The vertex v belongs 
either to a K-cell, or to a F-cell. In the first case v belongs to the boundary of a K-band 
in A. Since the length of this K-band is at most n, we conclude that d <n. 

In the second case v is within a constant distance di from a F-edge e. Consider the 
maximal F-band A containing e. We know that the length d2 of this band is at most 
n. The length of the boundary of this band is 0{n). The band A ends either on the 
contour of the diagram A or on the boundary of a transition cell. In the first of these 
cases d < di + 0{n) < 0{n). In the second case we can conclude that v is within distance 
0{n) from the boundary of a Q-band. As we know, the length of every maximal Q-band 
in A is at most n. Therefore the length of the boundary of any Q-band is < 0{n). Since 
every maximal Q'band in A ends and starts on the contour of A, we can conclude that 
d < 0{n). Therefore the diameter of A is 0{n). 

The lemma is proved. □ 

Later we shall need to analyze the process of reducing a non-reduced diagram without 
hubs over Vn{S). First we prove the following general results. 

Lemma 8.2 Let be a non-reduced diagram over any presentation. Consider a pro- 
cess of cancelling a sequence of reducible pairs of cells m \E'. // two edges e and f in 
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get identified in this process then there exists a path t in^! connecting the initial vertices 
i{e) and i{f) and such that Lab(t) = 1 in the free group. 

Proof. We shall use induction on the length of the cancellation process. If e = / 
in \1/ (that is if the length of the process is 0) then the statement is obvious. Suppose 
that e 7^ / in \1/ and e is identified with / after c cancellations. Let be the diagram 
obtained from \1/ after c — 1 cancellations. There exist a pair of cells vr and vr' in such 
that G ^(vr), / G d{7['), n and vr' form a reducible pair in and e and / are identified 
after we cancel tt and vr'. Since vr and vr' form a reducible pair in the boundaries of 
these cells have a common edge g. Since e and / are corresponding edges of vr and vr' the 
path which connects i{g) with t{e) on 8(71)^^ (oriented counterclockwise) and the path 
connecting z{g) with «(/) on d{7i') have equal labels. Let tti and vr^ be the "preimages" 
of the cells vr and vr' in Then these cells contain edges gi and g2 respectively such that 

• the path t connecting i{gi) with t{e) on d{TTi)^^ and the path t' connecting t{g2) 
with t{f) on 9(7r^) have equal labels and 

• the edges gi and g2 are identified (with the edge g) after c — 1 cancellations. 

By the induction hypothesis there exists a path p in \l/ connecting t{gi) with «(5'2) whose 
label is equal to 1 in the free group. Then the path t~^pt' in \E' connects z(e) with i{f) 
and its label equals 1 in the free group. The lemma is proved. □ 

Lemma 8.3 Let be a non-reduced diagram over any presentation. Consider a pro- 
cess of cancelling a sequence of reducible pairs of cells m // two cells vr and vr' in \E' 
cancel in this process then these cells correspond to mutually inverse relations and there 
exists a path p in connecting corresponding vertices in i9(7r) and d{Ti') whose label is 
equal to 1 in the free group. 

Proof. Indeed it is clear that vr and tt' correspond to mutually inverse relations. After 
a number of cancellations in an edge e on 9(7r) is identified with the corresponding edge 
/ on d{Ti')~^. It remains to apply Lemma ^.2| . □ 

Lemma 8.4 Let ^ be a non-reduced diagram over the presentation Vn{S) without n- 
annuli and without hubs. Suppose that each n-band in \I' is reduced. Consider a process 
of cancelling a sequence of reducible pairs of cells in . Then no K-annuli can appear as 
a result of this process and no two K-cells from the same n-band cancel. 

Proof. If we cancel two non-K cells then K-bands do not change. Let vr and vr' be 
K-cells which form a reducible pair in Let B and B' be the maximal /t-bands which 
contain vr and vr' respectively. Since vr and vr' are not hubs, they contain two K-edges 
each. Let e and / be the K-edges in vr and e' and /' are the K-edges in tt'. Notice 
that the start and end edges of B and B' belong to the contour of Finally assume 
without loss of generality that the median of B (resp. B') first crosses e and then / 
(resp. e' and /'). When we cancel tt and tt', the K-edges in vr are identified with the 
corresponding K-edges in vr'. By replacing B and B' with their inverses and renaming the 
edges e, /, e', /' if necessary, we can assume that e corresponds to e' and / corresponds to 
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/'. Let B = {iTi, . . . , Tii, TT, TTi+i, . . . , vTt), B' = (vtJ, . . . , TTj, vt', TTj+i, . . . , vTs). Then it is easy 
to check that after the cancellation we shall have two K-bands (tti, . . . , tTj, ttj, ttj-i, ■ ■ ■ , vti) 
and (vTi, . . . , TTj+i, TTj+i, . . . , TTs) (and their inverses) instead of B and B' (one or both of 
these bands may be empty). Both these K-bands start and end on the contour of \E'. All 
other maximal K-bands do not change. Therefore we do not get K-annuli by reducing tt 
and tt'. 

Now suppose that two cells tt and vr' of from the same K-band B eventually cancel 
as a result of our process of cancellation. Then by Lemma 8^ these cells correspond to 
mutually inverse relations and there exists a path t in \Ef connecting corresponding vertices 
V and v' of these cells whose label is 1 in the free group. It is easy to see that v and v' 
both belong to either top(i3) or to bot(i3). Therefore they are connected by a path t' 
which is contained in top(S) or bot(i3). Since t and t' connect the same vertices in \Ef, 
their labels must be the same modulo the presentation obtained from Vn{S) by removing 
the hub. Therefore Lab(t') = 1 modulo this presentation. But Lab(t') is a word over G. 
By Lemma |7.3| , this word must be equal to 1 in the free group. Therefore top(i3) contains 
two consecutive edges whose labels are mutually inverse. Then the corresponding cells 
form a reducible pair. This contradicts the assumption that every K-band in is reduced. 
The lemma is proved. □ 



9 Sectors 

By a sector we mean a reduced diagram A over V with boundary divided into four parts, 
d{A) = PiP2P3^P4^ , such that the following properties hold: 

• pi is the top path of an odd K-band /C; 

• p3 is the bottom path of an even K-band T; 

• p2 is the top path of a 6-band TZt, and the label Lab(j92) is a reduced word; 

• Lab(p4) = KfWKf where W is an admissible word for which there exists a com- 
putation connecting W with Wq (see Figure 5). 




Fig. 5. 
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Recall that a Kj-hand is called even (resp. odd) if j is even (resp. odd). 
If A is a sector then by removing the bands /C, T we get the inside diagram A' which 
has a boundary divided into 4 parts P\q^, Ptop' ^'right' ^bot ^^^^ ^^^^ ^'left — bot(/C), 

Ptop ^ P2, Pright ^ top(^)' Pbot ^ P*- 

In this section, we shall provide a complete description of sectors. 

We start with examples of sectors which correspond to computations of our machine 

S. 

As in the definition of the hub, for every word u let 

Notice that if m = Wq then K(u) is the boundary label of the hub. 

With every computation Wi, . . . , Wg we associate a van Kampen diagram over the 
presentation Vn{S) in the following way. 

If VFi, W2 is a one-step computation which uses a transition re© then there exists an 
annular diagram with boundary labels K{W]) and K{W2) of the form shown in Figure 
6 (in the case when N — 1): 



Wi 




Fig. 6. 

This 0-annulus is composed of cells corresponding to relations involving r. If we 
remove the 4A'" corners («;-cells) then the annulus breaks into AN equal diagrams. Each 
of them is obtained in the following way. Consider a path p labelled by Wi. Let r = 
\Ui ^ Vi,[/2 V2,...]. Since r is applicable to Wi, this word contains all UiS. Attach 
transition cells corresponding to the relations UJ = Vi to the subpaths of p labelled by Ui. 
Attach auxiliary r-cells to the remaining edges of p. Finally fold all pairs of edges with 



70 



the same label and the same initial vertex. It is easy to see that the resulting diagram is 
a B-band whose common edges are labelled by r, the top path is labelled by Wi and the 
bottom path is labelled by W2. 

Consider now an arbitrary computation C = {Wi, W2, ■ ■ ■ , Wg) with a history word 
h. Suppose that CS{Wg, Wq) is not empty. With every i = 1, . . . , g — 1 we associate 
the annular diagram with boundaries K{Wi) and K{Wi^i) as in Fig. 6. Since for every 
i = 1, . . . , g — 1 the inner contour of the i-th annular diagram is labelled by the same word 
as the outer contour of the {i + l)-st diagram, we can "concatenate" all these diagrams 
by gluing the next diagram in the hole of the previous diagram. As a result we obtain an 
annular diagram with outer contour labelled by K{Wi) and the inner contour labelled by 
K{Wg). This diagram is called a computational annulus corresponding to the computation 
Wi, Wg. This annular diagram is a union of AN sectors and inverses of sectors which 
will be called computational sectors corresponding to the computation C. Notice that the 
history word h of the computation C is equal to the label of the top (and bottom) path 
of any maximal K-band starting on the outer contour of the annulus. 

If C is an accepting computation then the inner boundary of the annular diagram 
corresponding to C is the boundary of the hub. Thus we can glue in the hub and obtain an 
(ordinary) van Kampen diagram with boundary label K{Wi). We call it the computational 
disk corresponding to the computation C. 

It is easy to observe that the area of the computational sector corresponding to the 
computation C is 0(area(C)) where area(C) is the area of the computation C. The area 
of a computational disc corresponding to C is thus also equal to 0(area(C)). We shall 
give an "abstract" definition of computational discs in the next section. 

In this section we shall prove that every sector can be transformed into a computational 
sector without changing the boundary label or increasing the area. 

Consider an arbitrary sector A with the inside diagram A', with boundary 

^(^')=PleftPtopPrightPbot' 

with an odd K-band /C and an even /t-band T. We assume that A has the smallest area 
among all sectors with the same boundary label. 

Notice that pjgf^ and P^-jgi^t consist of 6-edges. The maximal 9-bands starting on 
^'left ^^^^ ^^"^ '^'^ ^'right since A has no (A;, G)-annuli. For the same reason, maximal B- 
bands starting on Pf^^i^ must end on Pjgf^- Therefore the maximal 0-bands establish a 
one-to-one correspondence between edges on P\q{\^ and edges on Pfi^i^- Since all 9-edges 
in any 9-band have the same label, the labels of pjgf^ and Pfi^i^ are the same. Let us 
denote the maximal 9-bands in A starting on pjgf^ by 7li,...,7l£ counting from bottom, 
Pbot' *o Ptop- 

For every 1 < i < i the path bot(7^j)top(7^j_i)~^ bounds a subdiagram of A. The 
label of this path does not contain 9-edges. Therefore by Lemma |7.7| this subdiagram 
contains no cells. Hence bot(7?.j) = top(7?.j_i). 

By the definition of a sector, the label of the bottom path of TZi, that is the label of 
Pj^Q^, is an admissible word W such that CS{W, Wq) is not empty. Therefore the label of 
bot(7?.i) is a reduced word. The label of top(7^£) is a reduced word by the definition of 
a sector. We can also assume that the labels of hot(7li), i = I, are reduced words. 
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Indeed, if the label of the path bot(7^j) is not reduced then it contains a subpath 6162 
where ei, 62 are edges labelled by a and respectively, where a is a letter. These two 
edges must belong to two different cells tti and 112 of TZi as in the diagram on the left in 
the Figure 7. 





Figure 7. 

Then we fold ei and 62, producing a new edge e labelled by a and we introduce a new 
edge / with label a which has a common end vertex with e so that cells tti and tt2 have 
a common edge e. The other cells which were attached to ei and 62 will be attached to 
the edge /. This operation does not change the area or the boundary of the diagram A. 
The new diagram A must be reduced, otherwise after reducing it, we shall get a sector 
with a smaller area and the same boundary label as A (but we assumed that A has the 
smallest possible area). Thus the diagram A is again a sector with the same number of 
maximal B-bands starting on p\Qf^- The bottom path of the B-band TZi in A is shorter 
(by two edges) than the bottom path of T^j in A. The bottom paths of the other B-bands 
are not affected by this operation. Thus after a finite number of such operation we get a 
sector with the same boundary label as A, in which all bands TZi have top and bottom 
paths with reduced labels. 

The contour of A' does not have K-letters. Since A does not have K-annuli (this 
diagram is reduced). A' does not have K-edges. In particular, it has no hubs. 

For every component Qi of the vector of sets of state letters Q there exists exactly one 
Qj-edge on Pj^Q^ (here we use the form of the word Wo)- Therefore A contains exactly 
one maximal Qi-band starting on Pj^Q^ for every i. These Qi-bands do not intersect and 
cannot end on P]^q^, Pleft ^'right (^left ^'right contain no Q-letters). They cannot 
end on the contour of a hub because A has no hubs. Therefore these Qi-bands end on 
p^Qp. Since A contains no (Qj, B)-annuli (Lemma [7.4| ), each Qj-band starting on p\yQ^ 
intersects each B-band TZj {j = 1, exactly once. Also this implies that there are no 
Qj-hands which start and end on p^op- Therefore for every component Qi of the vector 
Q, A has exactly two mutually inverse maximal Qj-bands. These Qj-bands connect Ptop 
and Pi3Q^ and intersect each of the B-bands TZj exactly once (j = !,...,£). 

Therefore each of the B-bands TZj contains exactly one Qj-cell for every component 

Q^■ 
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Consider one of the B-bands TZ = TZj, j = 1, Let W be the label of the path 
hotiJZj) and let r G 6 be the label of the start edge of this 9-band. Suppose that W is 
an admissible word. Since TZ starts on pjgf^, the start edge of TZ belongs to pjgft, so it is 
oriented toward Ptop- 

For convenience suppose that r G 0+ (the case when r^^ G 6+ is similar). Let 

r = [f/i ^ V,, U2 ^ V2, Um ^ Vm] 

be the form of the S-rule r. By the definition of S-rules, Ui starts and ends with state 
letters, and the state letters from different f/j's belong to different components of Q, so 
these state letters are different. 

Let q and q' be the first and the last state letters of Ui, q E Qa, q' & Qb for some 
numbers a and b, a < b. There exists only one (up to cyclic shifts and inverses) relation 
in Vn{<S) containing a letter from Qa and r. This is the relation UIV^''^ = 1. The same 
relation is the only relation containing a letter from Qb- We have proved that the Qa-band 
and the Q^-band starting on P]^qi intersect with TZ. Therefore their medians intersect with 
bot(7^). Thus bot(7^) contains a Q^-edge e and a Q^-edge e'. These edges are oriented 
from to Pj-[g\it^ that is both of these edges belong to the path p-^^^iJZ). The labels of 
e and e' are q and q', respectively because these are the only Qa- and Q^-letters appearing 
in relations of Vn{S) together with r. Let vr and vr' be cells in TZ containing e and e'. 
The boundary of tt must have the form e~^tw'{t')^^ f^^w^^ where Lab(t) = Lab(t') = r, 
Lah{w') = Vi, Lab(/) = q', Lab(ew/) = Ui. If / is not equal to e' then / must belong 
to the contour of the a cell vr" of TZ which goes after vr. Then /, e', the second Q^-edge 
of 71 and the second Q^-edge of vr" belong to the same Q^-band (as we know there are 
only two mutually inverse Qf,-bands in A). Therefore this Qf,-band intersects TZ twice 
which is impossible by Lemma |7.4| . Therefore / = e'. Thus W contains subword Ui for 
every i = 1, ...,m, TZ has cells with boundary words rl^,r~^f/j~ for every ^ = 1, m, and 
the subpaths of these boundaries labelled by U~^ are subpaths of P^q^- Notice that the 
subpaths labelled by Vi are the top paths of these cells considered as 0-bands. 

If W has a Qj-edge e which does not belong to any of the Ui then the cells involving 
e must correspond to the commutativity transition relation q'^ = q where q = Lab(e). 

Removing from TZ all these transition cells which were discussed in the last two para- 
graphs, we obtain several G-bands without Q-edges on their contours. By Lemma [7^ 



these 0-bands must consist of auxiliary cells. It is clear that such a 6-band is uniquely 
determined by the label of its bottom path (which is equal to the label of its top path 
because the top and the bottom paths of an auxiliary cell considered as a 0-band, coin- 
cide). 

Summarizing the information that we have obtained, we can conclude that W has the 
form 

9lPl92P2--9cPc9c+l 

where gi are words without state letters, and either Pi = Uj or Pi G Qj such that none of 
the ?7's contain a Qj-lettei. The label of the top path of TZ is obtained by reducing the 
word 

9lRl92R2---9cRc9c+l 
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where Ri = Vi if Pi = Ui and Ri = Pi if Pi is none of the f/'s. Thus the label of top(7^) 
is obtained from W by applying the S'-rule r. 

Notice that if the start and end edges of TZ are labelled by where r G 0+ then the 
same argument shows that Lab(top(7^)) is obtained by applying the rule r^^ to W. 

We have proved that the sequence C of words 

Lab(p]^Q|.), Lab(top(7^i)), Lab(top(7^2)), Lab(top(7?.^)) 

is a computation of the ^-machine S. 

Notice also that our argument shows that there is a unique B-band with top path 
labelled by Lab (top (7?.)), the bottom path labelled by W and the 9-edges labelled by 
r. In order to create this band one needs to draw a path p labelled by W, glue in cells 
with boundary labels tVit~^Ui^ , tV2T~^U2^ , rV^r~^[/~^ to the parts of the path p 
labelled by Ui, Um respectively. Then attach auxiliary r-cells to the edges of p which 
are not in the parts of p which we used before. If after that we get two edges with the 
same labels and the same initial vertex, we fold them. After all these foldings we get a 
0-band with the top path labelled by Lab(top(7^) and the bottom path labelled by W. 

Therefore the sector A coincides with the computational sector corresponding to the 
computation C. Thus we have proved the following result. 

Proposition 9.1 Every sector can be transformed into a computational sector without 
increasing the area or the boundary label. The area of a computational sector is "big O" 
of the area of the corresponding computation 



10 Computational Discs 

Let A be a reduced diagram over Vn{<S) with reduced boundary label and exactly one 
hub. Suppose further that A does not have B-edges on the contour. In this case we call 
A a disc. The goal of this section is to provide a description of discs. 

Lemma 10.1 If A is a disc then every maximal K-band in A contains the hub, and 
every maximal Q-band starts or ends on the contour of the hub. 

Proof. Suppose that there exists a maximal K-band /C which does not contain the hub. 
By Lemma [TTI , /C starts and ends on the contour of A. Cutting A along the path top(/C) 
we obtain two diagrams Ai and A2. Since A has only one hub, one of these diagrams 
does not contain a hub. Without loss of generality assume that Ai is hub-free. The path 
top(/C) consists of 6-edges since /C does not contain hubs. These 6-edges cannot cancel 
when we form top(/C), otherwise /C would not be reduced. Consider the maximal 0-bands 
in Ai starting on top(/C). Since Ai does not contain hubs, these B-bands cannot end on 
top(/C) (Lemma |7.5| ). Therefore they end on 9(A). This contradicts the assumption that 
9(A) does not contain 9-edges. 

If A has a maximal Q-band Q which does not start or end on the contour of the 



hub then Q starts and ends on the contour of A (Lemma |7.2|) . Cutting A along top(Q) 



we obtain two diagrams Ai and A2, one of which, say Ai, does not contain hubs. The 



74 



©-edges do not cancel when we form the path top(Q) because otherwise two cells in Q 



form a reducible pair. Therefore the path top(Q) contains G-edges. By Lemma [7.4| the 
maximal 6-band starting on one of these edges must end on d{A). This again contradicts 
the assumption that d{A) does not have B-edges. The lemma is proved. □ 

Suppose that A is a disc. Since every cell in A (except the hub) contains a 6-edge, 
A is covered by G-bands (except for the hub). Since there are no G-edges on the contour 
of A, each maximal G-band in A is an annulus. Every G-annulus must contain a hub in 



its inside diagram (Lemma |7.6|) . Therefore every G-annulus in A goes around the hub, so 
the G-annuli in A form concentric annuli surrounding the hub. We shall consider only the 
G-annuli that go clockwise around the hub (other G-annuli are the inverses of these G- 
annuli). Let TZo be the outermost G-annulus and TZi be the innermost G-annulus. Every 
edge of the contour of A belongs to the contour of a cell. Indeed, otherwise by removing 
this edge one gets two disjoint diagrams one of which does not contain hubs. Since this 
hub-free diagram does not have G-edges on its contour, it must be empty (Lemma |7. 7| ) . 



Since the G-bands in A form concentric rings, all contour edges must belong to contours 
of cells from top(7^o). It is clear that bot(7?.j) fl bot(7^j)~^ is the contour of the hub 
(there cannot be any cells between bot(7^j) and the contour of the hub). 

Let us remove the hub tt from A and consider the maximal K-bands in the resulting an- 
nular diagram Ai starting on d{7i). Let us enumerate these bands clockwise by Si, B^n- 
Since A contains no maximal K-bands which do not contain the hub (Lemma |10.1| ), and 



since there are no K-annuli in A (Lemma [7.1|) , Ai is covered by AN subdiagrams Si 



T,4N such that Sj is bounded by top(i3j), a part of <9(A), bot(i3j+i) (addition modulo 4A^) 
and a part of 9(7r). By definition each of Sj is either a sector or the inverse of a sector, 
depending on whether the K-band Bi is even or odd; TZo H Sj is the top G-band in this 
sector. Using Proposition we can transform each sector ^ into a computational 



sector of the same or smaller area and with the same boundary. Thus we can assume that 
each Sj is a computational sector. 

The label of p-^Q^^ in each of these sectors is Wq. Let w be the history word of the sector 
El. This word is equal to Lab(top(Si)). By the definition of computational sectors, 

Lab(top(i3i)) = Lab(bot(i3i+i) = Lab(top(i3i+i)) 

for every i = 1, ...,4:N. Therefore the history words of all these sectors are the same. By 
the definition of a computational sector, computational sectors with the same label of 
the bottom paths and the same history words are homotopic. This implies that if we let 
Lab(top(7^o n El)) = u, then 

Lab(a(A)) = K{u). 

It is clear that the area of the disc is "big O" of the area of the corresponding com- 
putation. Let us compute the diameter of the disc. Take any vertex v inside the disc. 
It belongs to one of the G-annuli of the disc. Let is be the G-band number i counted 
from the hub to the boundary of the disc. It is clear that this vertex is within a constant 
distance (less than the maximal length of a relator of Gn{S)) from the G-band i — 1. 
Therefore v is within distance Ci from the boundary of the disc. Since the number of 
G-annuli in the disc is equal to the length of the corresponding computation, the diameter 
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of the disc is bounded above by a constant multiple of this length. On the other hand 
pick a vertex on the hub and consider any path p connecting this vertex with a vertex 
on the boundary of the disc. By Jordan's lemma, this path must cross medians of all 
0-annuli in the disc. Therefore the length of p cannot be smaller than the length of the 
computation corresponding to the disc. 
This proves the following statement. 

Proposition 10.1 Every disc can be transformed without changing the boundary and 
without increasing the area into a computational disc. The area of this disc is "big O" 
of the area of the corresponding computation. The diameter of the disc is "big O" of the 
time of the corresponding computation. 



11 The Upper Bound 

In this section, we shall prove that for every word w which is equal to 1 in Gn{S) there 
exists a van Kampen diagram A over Vn{S) with w = Lab(9(A)) such that A can be 
decomposed in some standard way into diagrams with at most one hub (like a snowman 
can be decomposed into small snow balls). This decomposition will then help us find an 
upper bound for the area and diameter of A, as functions of \w\. We assume that the 
function T^(|w|) (the area function of the machine S = S{M) is superadditive. 

Consider the following graph G{A) associated with the diagram A. The vertices of 
A are the hubs and the edges are maximal hub-free K-bands connecting the hubs. Since 
K-bands cannot intersect except in the hubs, G{A) is a plane graph (map). Every internal 
vertex of this graph has degree AN. We use the theory of plane graphs (maps) similar to 
small cancellation theory (see [^). In order to do that we need the following fact. 



Lemma 11.1 If A has the smallest possible number of hubs among all reduced dia- 
grams with the same boundary label, then G{A) does not contain polygons with only one 
vertex and does not have bigons (polygons with only two vertices). 

Proof. The first part of the statement follows immediately from the fact that A does 
not have K-annuli. 

Suppose that A has a bigon with two vertices tti and tt2 and two edges Bi and 82- We 
can assume that there are no K-bands inside the bigon between Bi and B2, so (considered 
as edges of G{A)), Bi and B2 are consecutive edges of tti. 

This implies that one of the bands Bi or B2 is a Kj-band and another one is a Kj+i- 
band. Let p2 (resp. P4) be the shortest subpath of d{7ii) (resp. i9(7r2)) containing the 
start (resp. the end) edges of Bi and B2. Then Lab(p2) = Lab(p4)^^ = (KjWoHj+i)^^. 
Without loss of generality we can assume that Lab(p2) = Lab^p^)'^ = HjWoHj^i. By 
renaming Bi and B2 and by replacing A by A~^ if necessary we can also make the path 
top(i3i)p2bot(i32)~V4 ^ bound a subdiagram S containing the bands Bi and B2. Every 



9-band which crosses Bi must cross B2 (Lemma |7l5|) . Therefore p2 is the top path of a 
0-band in S (the one which is the closest to tti). Thus S is a sector. By Proposition |9.1| , 
we can assume that S is a computational sector. Let d{iT2) = PaP^. 
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Let w be the history word for E. Let D be the computational disc with a hub vr 
corresponding to the same history word. Then D contains a copy Si of S. The boundary 
of the diagram Di = Z)\(Si U it) has the form S1S2 where 

Lab(s2) = Lab(top(i3i)p5bot(i32)"^). 

Thus we can cut A along the path top{Bi)p5hot{B2)^^ and insert the diagram 

in the hole (see Fig. 8). 




Fig. 8. 

The resulting diagram will have the same boundary label as A. The maximal disc with 
the hub 712 in this diagram corresponds to the history word w. Thus the boundary label of 
this disc will be K{Wq), the same as the boundary label of the hub 7r2. Therefore this disc 
without the hub is an annulus whose inner and outer boundaries have the same labels. 
Thus we can remove this annulus and obtain a diagram A' with the same boundary label 
and the same number of hubs as A (see the proof of Lemma |7. 1|) . But in A', the hubs 
TTi and 7r2 have a common path labelled by KjWoKs- By Lemma ^]T| (d), these two cells 
cancel. By cancelling these hubs we obtain a diagram with the same boundary label and 
a smaller number of hubs. This contradicts the assumption that A contains the minimal 
possible number of hubs. The lemma is proved. □ 

Lemma 11.2 G{A) does not have triangular faces. 
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Proof. Indeed, suppose that there are three hubs vri, 712, tt^ connected by three bands 
Bi, 82,83 such that B2 and B3 start on the contour of vti and end on the boundaries of 
TTs and 7r2 respectively, Bi connects boundaries of 112 and tts, none of the bands Bi, B2, B3 
contains hubs, and there are no hubs in the triangle bounded by these bands. Since B2 
and B3 are two consecutive edges of tti in G{A), the K-edges in one of these bands have 
even index and the K-edges in the other band have odd index. Let, say, B2 be a Kj-band 
with even i and B3 be a Kj-band with odd j. The band Bi connects tt2 and 773. Since 
Bi and B3 are consecutive edges of H2, the index of K-edges in Bi must be a even. But a 
similar argument applied to the hub ir^ gives that this index must be odd, a contradiction. 
□ 



Lemmas |11.1| and |11.2| shows that all faces in the planar graph G{A) have degrees at 
least 4. 

The following two lemmas are probably well known but we were not able to find them 
in the literature. 

Lemma 11.3 Let G be a non-empty plane graph in which every face has degree at least 
4- Then G either consists of one vertex or has two vertices of degree at most 3. 

Proof. Suppose that G contains more than one vertex. We shall use the notation 
from ||2^ (Chapter 5). Let V, E, F be the numbers of vertices, edges and faces in G. Let Q 
be the number of connected components in G. E refers to the summation over all vertices 
V or faces D in G, E' denotes the number of all boundary edges of G (the length of the 
boundary of G). By d{v) and d{D) we denote the degree of a vertex f or a face D. Then 
by Theorem 3.1 in Chapter 5 of the following formula holds (we take p = q = 4:, h = 
in the formula in this theorem): 

4Q = i:[A-d{v)] + J:[4-d{D)]- E-. (29) 

But by assumption, d{D) > 4 for every face D. Therefore 4Q < S[4 — d{v)]. The left 
hand side is at least 4. Hence either there exists a vertex with d{v) = 0, or there are at 
least two vertices of degree at most 3. In the first case the vertex of degree forms a 
connected component of the graph. Since G has more than one vertex, Q > 2. Then the 
left hand side of ( ^91) is at least 8. This implies that the right hand side contains at least 



two positive terms, that is G contains two vertices of degree at most 3. □ 

Now we are in a position to define the snowman decomposition of A. For the rest of 
this section we fix a large enough number A^. Later we'll show that we can take > 6 
where k is the number of tapes of the original Turing machine. Recall that the number 
of sectors in every computational disc is AN. 

With every diagram A we associate two numbers: 

• n{A) = \d{A)\ — the length of the boundary of A; 

• h{A) — the number of hubs in A. 
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Let /C be a K-band in A. Let A', \1/ be the subdiagrams obtained by dividing A along 
top(/C) such that \1/ contains /C. If \I/ does not contain hubs then we call /C a dividing 
K-band. In particular a dividing /t-band is hub-free. 

Suppose that A contains a dividing K-band /C. Clearly both A' and are reduced, 
h{A) = h{A'). Suppose that d{^) = top(/C)pi, d{A') = p2top{IC)-\ Since /C does 
not contain hubs, the word Lab(top(/C)) consists of 0-letters. The maximal 0-bands in 



\1/ which start on top(/C) cannot end on top(/C) because of Lemma |T^. Therefore all 
these 9-bands end on pi (see Fig. 9). Since pi contains at least 2 K-edges, we have 
bil > |top(/C)| +2. This implies that \d{^)\ < 2\d{A)\ and \d{A')\ < \d{A)\ -2. 
Thus we have proved the following properties of the pair A'). 

(Dl) \& and A' are reduced. 

(D2) By gluing \1/ and A' we obtain a diagram with the same boundary label as A. 
(D3) does not contain hubs. 

(D4) A' contains a smaller number of dividing K-bands than A. 
(D5) n(^) < 2n(A), n{A') < n{A) - 2, h{A') = h{A). 



K 



K 




w{Q) 



Figure 9. 



In Figure 9 w{Q) denotes any group word over the alphabet B. 

Since \1/ has no hubs, its area can be computed with the help of Lemma Thus in 
order to estimate the area of A we can replace A by A'. We shall call the pair A') 
the decomposition of type 1 of A. 



Suppose that A contains no dividing K-bands. Then by Lemma |11.3| there exists a 
vertex vr in G{A) of degree at most 3. Let Bi, . . . jB^n be the maximal /t-bands starting 
on the boundary of vr, counted clockwise. Since the degree of vr is at most 3, at least 
AN — 3 of these bands do not contain hubs (so they are not edges in G{A)). 

Lemma 11.4 There exist at least two hubs vr and tt' in G{A) of degree at most 3 in 
G{A) such that AN — 3 consecutive n-bands starting on (9(7r) (resp. diji')) do not contain 
hubs. 
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Proof. Induction on the number of hubs. By Lemma |11.3| , G{A) has at least two 
vertices vr and n' of degree at most 3. Suppose that vr does not have 4iV — 3 consecutive 
K-bands without hubs. Then there exist three maximal /t-bands Bi, Bj, Bm, i < j — ^, 
j < m — 1 starting on d^ii) such that Bi, Bj and Bm do not have hubs, but at least one 
band among . . . , Bj-i and at least one band among Bj+i, . . . , Bm-i contains a hub. 

Consider the subdiagram \E' of A containing Bi, . . . , Bj^i and vr, and bounded by 
top{Bi), bot(i3j_i), a part of d{7c) and a part of i9(A). The subdiagram \E' contains fewer 
hubs than A because among the /t-bands Bj^i, . . . , Bm-i there is at least one hub, which 
is not contained in By the induction hypothesis, \1/ contains two hubs tti and 7r2 of 
degree at most 3 such that there are AN — 3 consecutive maximal /t-bands starting on tti 
(resp. 7^2) which do not contain hubs. One of these hubs, say, tti differs from tt. It is 
easy to see that tti considered as a vertex in G{A) also has degree at most 3 and AN — 3 
consecutive K-bands without hubs. 

Now consider the subdiagram of A containing Bj+i, . . . , Bm-i and vr bounded by 
top{Bj), hot{Bm), a part of ^(vr) and a part of 9(A). The same argument as in the case 
of the subdiagram shows that \I'' contains a hub 7c[ of degree at most 3 which has 4N-3 
consecutive fc-bands without hubs. Thus we found two hubs tci and tt[ which satisfy the 
conditions of the lemma. □ 



Lemma 11.5 For every hub tt in A with hub-free consecutive maximal n-bands Bi,..., 
Bans starting ond^n), lef^^^n) be the subdiagram of A bounded by top{Bi), hot^B^N^s) , 
d{A) and d{7i), which contains Bi, B^n-s and does not contain tt (there is only one sub- 
diagram in A satisfying these conditions). Then there exists a hub tt such that \E'A(7r) 
does not contain hubs (see Fig. 10). 



Fig. 10. 

Proof. Induction on the number of hubs in A. By Lemma |11.4|, there exists a hub tt 



in A with hub-free consecutive maximal K-bands Bi, . . . , B4N-3 starting on d{Tr). Suppose 
that A' = \I/A(7r) contains hubs. Notice that A' contains fewer hubs than A since A' does 
not contain tt. By the induction hypothesis A' contains a hub tt' such that \1/A'(7r') does 
not contain hubs. Notice that none of the K-bands in A starting on the contour of tt' 
can intersect any of the K-bands Bi,. . . B^n^^ because these K-bands do not contain hubs. 
Therefore \1/a'(^') — ^a(71"'); so vr' is a hub for which ^I/aItt') is hub-free. The lemma is 
proved. □ 



By Lemma |11.5| we can find a hub tt and AN — 3 consecutive K-bands starting in d{TT) 
which do not have hubs and moreover \1/A(7r) is hub-free. We can assume that these 
K-bands have indices 1, 2A^, 1, 2A^ — 3 (otherwise we can rename k's). We denote 
these AN — 3 K-bands hj B[, . . . , B'^^_^ (as usual we enumerate the K-bands clockwise). 
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Let n be the maximal disc in A with the hub tt. By Proposition |10.1| , we can assume 
that n is a computational disc. We can also assume that 11 corresponds to a minimal 
(with respect to area) accepting computation. By removing the interior of 11 from A we 
get an annular diagram Denote the parts of B'^ [i = 1, ...,4A^) in \E' by Bi. Cutting \1/ 
along top(;Bi) and hot{B4N~3) we obtain two ordinary van Kampen diagrams and \I^2 
such that Bi, B2, ■ ■ ■ , B4^n-3 C ^1, B4^n-2, B4n~i, B^n C \1'2. Notice that v&i = \E'A(7r) -H. 

We shall fix the following notation associated with the triple \l/2, H) (see Figures 
11 and 12): 

• 9(^i) =top(Si)pibot(S4JV_3)-'si; 

• 9(^2) =t0p(i3i)-ls2-lb0t(i34^_3)p2; 

. d{U) = sih^; 

• Pi = e(i3i)tie(i32)t2 • • • ^4Af-4e(i347v-3) where e{Bi) is the end edge of Bi, tj is a 
subpath of j9i, z = 1, . . . ,4A^ - 3, j = 1, . . . ,4A^ - 4; 

s^^ = i{Bi)yii{B2)y2 ■ . .ym-^iiBm-s), 
S2 = yAN-3i{BAN-2)yAN-2i{BAN-i)ym~ii{Bm)yAN 

where i{Bj) is the start edge of Bj, for j = 1, 4A^ — 4, yj is a subpath of s^f^, for 
j = AN — 3, 4A^, yi is a subpath of S2; 

• Sj {i = 1, . . . ,4:N — 4) is a subdiagram of \&i bounded by top(Sj) on the left, 
hot{Bi^i) on the right, pi on the top and sj"^ on the bottom; 

• if \&i contains a G-band TZ which crosses Bi and and such that the intersection 
with Bi precedes in TZ the intersection with Bi+i then t{TZ, i) is the portion of 
top(7^) which is contained in S^; t(Jl,i) = f{7l,i)y(7l,i)f(7l,i + 1) where f(Jl,i) 
and f(Jl,i + 1) are K-edges; 

• b = |top(Si)| + |top(i32)| + . . . + |top(i347V-3)|; 

• c = \yi\. 



Fig. 11. 



Fig. 12. 
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When we need to specify the triple T = (^i, ^2, H) to which this notation is associated, 
we shall write Bi{T) instead of Bi, etc. 

The triple (^1, ^2, n) satisfies the following properties. 

(PI) By gluing together ^^i, ^2 and 11 one can get a diagram with the same boundary 
label as A. 

(P2) ^'i is reduced and contains no hubs. 11 is a computational disc corresponding to a 
minimal area accepting computation. 

(P3) The K-bands B^, i = 1, . . . , iN — 3, in start on sj"^, and end on pi. Every /t-band 
in starts or ends on si. 

(P4) The word Lab(/i|/i . . . fmUm) is a cyclic shift of the word K{W) for W = Lab(yi). 
For alH = 1, . . .,47V, c = \yi\. 

(P5) /i(*2) + l = MA). 

We assume from now on that we have a triple of diagrams (^1, ^^2, H) which satisfies 
the properties (P1)-(P5). We shall transform this triple into another triple which satisfies 
the same properties plus some additional conditions. 

Suppose that there exists a 0-band in ^1 which crosses the bands Bi, . . . , B^n-s- 
Then let TZ be the 0-band with this property whose intersection with Bi is as far away 
from si as possible (the distance is counted along Bi). We assume that 71 starts on 
top(Bi). Consider the diagram E bounded by top(Bi), top(7?.), bot(B4Ar_3) and si. 
Then E' — Eog^^g^ uf^i~^S^ is a union of sectors and inverses of sectors which correspond 
to the same history word w'. Let us remove this subdiagram from and consider the 
resulting diagram as a new ^1. Then extend the disc IT by an annulus with the history 
word w' and replace this disc by a computational disc with the same boundary label which 
corresponds to a computation of S with the smallest possible area. Let it be the new U. 
After that, glue ^'2 to the union of the four diagrams (sectors and inverse sectors with the 
history word w') T.^'^, i = 4:N — 3, AN — 2, AN — 1, AN, then reduce the resulting diagram. 
Let this diagram be the new \E'2 (see Fig. 13). The new triple (^1,^2)11) obviously 
satisfies properties (P1)-(P5). We shall call this operation moving Q-bands. In addition, 
this triple satisfies the following property. 

(P6) There are no G-bands in ^1 which cross all the bands Bi, . . . , B^n-z- 



Fig. 13. 
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Now suppose that we have a triple T = \l/2,n) which satisfies conditions (PI)- 
(P6). 

Take any number i = 1, . . . ,4iV — 4. Suppose that the following property does not 
hold: 

(P7) There is no 9-band 71 in Ej such that \y(7l,i)\ < \yi\. 

Then take a 0-band TZ such that |?/(7^, i)] (which is less than \yi\) is the smallest possible 
(over all i and all 9-bands TZ). 

Consider the subsector T of Sj bounded by top{Bi), t{7l,i), bot(;Bi+i), hiyihi+i (with 
TZ as the top 9-band). Let w' be the history word of this sector. We can assume by 
Proposition |9.1| that T is a computational sector. Consider the computational annulus E 
corresponding to the history word w' and initial configuration Lab(y(7^, i)). Then the disc 
n fits in the hole of this annulus. By gluing 11 inside E, and replacing this computational 
disc by a disc corresponding to a minimal area computation, we get a computational disc 
which we denote by Hi (see Fig. 14). 



Fig. 14. 

Let El be the union of AN — 4 consecutive subsectors of E such that the part p of 
the inner boundary of E contained in Ei has the same label as Si. Let E2 = E\Ei. Let 
us form a diagram \E'i 1 by the following procedure. First {step 1) we glue \l'i and E{^ 
along p = si. Notice that the /c-bands of E^^ will be glued to the K-bands of along 
the edges fi, so the /t-bands are getting longer. Some K-cells in \E'i will cancel with K-cell 
in Ei^. In step 2 we cancel these pairs of K-cells. Notice that if j = 1, . . . , AN — 3 and w' 
has a common suffix with Lab(top(i3j))) of length dj then dj cells of Bj will cancel. In 
particular, since w' is a suffix of Lab(top(i3j)), \w'\ cells in Bi cancel. Then in step 3 we 
reduce other reducible pairs of cells. The resulting (reduced) diagram is Finally let 
us glue \E'2 and E2^, reduce this diagram and denote the resulting diagram by \E'2,i. We 
call the process of obtaining the triple (^E'l,!, "^2,1, Hi) an adjustment. 

Lemma 11.6 The triple Ti = (\E'i^i, \E'2,i, Hi) satisfies the properties (P1)-(P5). For 
every i = 1, . . . ,4iV — 3, Lab(top(i3j(TL))) = (i/;')~^Lab(top(i3j(T))) (equality in the free 
group). 

Proof. Properties (PI) and (P2) are obvious. 

After step 1 of building \E'i 1 the K-bands of the resulting diagram will be just unions of 
corresponding pairs of K-bands in \E'i and -Ef^, so they will start on Si(Tl) C 9(-Ef ^) and 
end on pi{Ti) = pi{T). This property will still hold after step 2 because in this step we 
are cancelling cells from the same K-bands (notice that si(7i) and pi(7i) do not change 
after steps 2 and 3 because reducing a diagram does not affect its boundary). By Lemma 
8]1, no K-annuh are produced in step 3. So step 3 is just the process of cancelling reducible 
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pairs of cells. By Lemma |8.4|, no K-cells from the same fc-band cancel during this process. 



Cells from different K-bands Bj-^ and Bj^ do not form a reducible pair because these cells 
do not have common edges. Let be the diagram obtained after a number of reduction 
from step 3. Then for every j = 1, . . . , 4iV — 4 there exists an Eo-band which starts on 
si(7i) between Bj and Bj+i and ends on piiTi) between the same K-bands. Therefore no 
cell in Bj can have a common edge with a cell in Bj^i. Since does not have any other 
K-bands, no K-cells participate in the reduction process of step 3. Therefore the K-bands 
Bj, j = 1, . . . ,4N — 3, do not change during step 3. This proves property (P3) and the 
fact that in the free group, Lab(top(i3j(7^))) = (w')~^Lab(top(i3j(T))). 

Properties (P4) and (P5) immediately follow from the definition of the triple Ti. □ 

Since Ti satisfies properties (P1)-(P5) we can use all the notation associated with 
triples satisfying these properties. Notice that by construction 7j satisfies the property 
c(Ti) < c{T). The triple Ti may not satisfy property (P6). We shall discuss this property 
in the next lemma. Consider the following property: 

(P6') The words Lab(top(i3j)), j = 1, . . . , AN — 3, do not all have the same first letter. 

The following lemma shows, in particular, that (P6') is stronger than (P6). 

Lemma 11.7 a)IfTi does not satisfy property (P6') then w' is a prefix of the label of 
top{Bj(T)) for every j = 1, . . . , AN — 3. In this case 

h{r,) < b{T). 

b) If Ti satisfies property (P6') then T\ satisfies property (P6) (that is (P6') implies 
(P6)). 

c) If T satisfies (P6') then so does Ti. 



Proof, a) If w' is not a prefix of Lab(top(i3j(T))) for some j then by Lemma p.1.6 the 



first letter of Lab (top (7^))) is the first letter of {w')~^. On the other hand, by construc- 
tion, w' is a prefix of Lab(top(i3j(T))) (for the particular i considered above). Therefore 
the first letter in Lab(top(i3j(TL))) is the letter number \w'\ + 1 in Lab(top(i3j(T))). 
If these two first letters in Lab(top{Bj(Ti))), resp. Lab(top(i3j(7^))) coincide, then 
Lab(top(i3j(T))) contains a pair of consecutive mutually inverse letters; but this is impos- 
sible since Bi(T) is reduced. Therefore if the first letter of each word LaJo(top{Bj(Ti))) 
is the same, then w' is a prefix of Lab(top(;Bj(T))) for j = 1, . . . ,AN — 3. It is easy to 
see that in this case b(Ti) = b(T) — {AN — 3) * \w'\. 

b) If there exists a 9-band in \I'i(Tl) = \E'i,i which crosses all the K-bands Bi(Ti), 
2 = 1,..., AN — 3, then there exists a G-band containing the first cells of all these K-bands 
(this is the maximal B-band containing the first cell in Bi(Ti)), so the indices of the 
G-letters in the relations corresponding to these cells are the same. Therefore the first 
letters in the words Lab (top (^^(Tl))) are the same, a contradiction. 

c) If the the words Lab(top(i3j(T))), j = 1, . . . ,AN — 3, do not all have the same 
first letter, then w' cannot be a prefix of all these words, and we can apply part (a). The 
lemma is proved. □ 
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If Ti does not satisfy property (P6') then by Lemma [11.7] , h{Ti) < b(T). If Ti does 
not satisfy (P6) then we can apply the band moving construction and obtain a triple 7^ 
which satisfies (P1)-(P6). Notice that the band moving construction strictly decreases 
the parameter b. If T2 does not satisfy property (P7) then we can repeat the adjust- 
ment and band moving constructions and obtain a sequence of triples T2, T3, . . . such that 
b{Ti) > 6('7^) > . . .. For some s > the triple Tg satisfies condition (P6'). By Lemma 
|11.7| , Tg satisfies condition (P6) and the triples obtained from 7^ by any number of ad- 
justments satisfy (P6'); therefore we do not need further applications of the band moving 
construction to obtain 7^+i, 7^+2, • • •• 

As we noticed before, c(7i) < c(T). Since we do not use the band moving construction 
to obtain 7^+i,..., we have c(7^+i) > c(Z,+2) > ••• (notice that the band moving con- 
struction can increase the parameter c) . This sequence cannot be infinite so there exists t 
such that the triple % satisfies all seven properties (P1)-(P7). Without loss of generality 
we can assume now that T itself satisfies these properties. 

Now let T = \I'2, n) be a decomposition of A satisfying properties (P1)-(P7). We 
shall prove that the perimeters of \l/2, 11 relate nicely to the perimeter of A. 
From now on we shall assume that 

N>6. (30) 

Notice that because of Property (P6) every 9-band in starting on top(i3i) U 
bot(i34Ar_3) ends on pi. 

If a 0-band 71 of starting on an edge e & ti, i < AN — 4, ends on the contour of 
^AN-?, (resp. Bi) then we paint the edge e in red (resp. orange). We also shall call cells 
of these 6-bands red (resp. orange). 

Since B-bands cannot intersect, there could be at most one i between 1 and AN — 4 
such that ti contains both red and orange edges. Also it is clear that if ti contains a red 
(resp. orange) edge then tj+i (resp. tj_i) contains no orange (resp. red) edges. Therefore 
either ti, t2Af-3 contains no red edges or t2Ni t4Ar-4 contain no orange edges. Without 
loss of generality we assume that the second possibility holds. 

In this case let us erase the red paint from all red edges on the last 4 of the tjS 
[j = 4iv _ 7^4jv - 6,4A^ - 5,4A^ - 4). We shall save the last four of for the later 
construction, so we shall not touch the corresponding tj and yj by a paint brush. Notice 
that by our assumption these tj do not have orange edges. Therefore every B-band TZ in 

which crosses 84^^-7, niust have one of its ends on a tg with i < AN — 7. We paint 
this end of TZ in red if it is not red already. Then we call the maximal B-band starting 
on this edge red as well. 

Among the 2N — 7 numbers 2N, AN — 8 exactly — 4 > 2 numbers are odd. Notice 
that if i is odd then yi starts with a E(0)-edge. 

Pick one of the odd numbers i G {2A^ + 1, 2A^ + 3, 4A^ — H}- It will be clear later 
why we do not consider i = AN — 9. 

Let W be the word written on yi. We shall compare \\W'\\ and the length of ti. In 
order to do that we establish a correspondence between a^^-edges of yi and edges of ti. 
Then we shall prove that only a constant number of a^^-edges on yi can correspond to 
the same edge in ti. 
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Let us paint all colorless G-edges of ti in green. All cells of the maximal G-bands in 
Sj starting on these edges will be called green. Notice that since ti contains no orange 
edges, all G-edges of ti are either green or red. Therefore the G-band starting on a green 
edge of ti ends on pi. Since by definition of i/i no G-band in Sj which starts on Bi can 
end on Bi+i, every transition cell in S, is either red or green. 

We shall paint an a^^-edge of yi in green if the maximal F-band in Sj starting on this 
edge ends on a green transition cell. 

Lemma 11.8 No more than 4 Y -bands starting on a^^ -edges ofni can end on a green 
cell belonging to the same Q-band. 

Proof. Indeed, suppose that 5 F-bands 3^i,...,3^5 starting on a-edges 61,62,63,64,65 
end on the contour of the same G-band 71. We can assume that 61 precedes 62 precedes 
... 65 on Hi. Then first three of these edges are to the left of the X(0)-edge of i/i or the 
last three of these edges are to the right of this X(0)-edge. These two cases are similar 
so we shall consider only the first case. Let pi, p2, ps be the green cells on the contour 
of which the F-bands 3^i, 3^2, 3^3 end. These cells are transition cells corresponding to the 
rules of S involving a. Such transition cells must contain a X(0)-edge on the contour, so 
Pi, P2 and p3 belong to some maximal X(0)-bands Xi, X2, respectively ( see Fig. 15). 
Consider the subdiagram S' of Sj bounded by 3^i, 3^3, 7^ and The part of X2 which 
is contained in S' cannot cross TZ (Lemma |7.4|) . It also cannot cross and 3^3 because 
3^1 or 3^3 would contain a transition cells and F-bands cannot contain transition cells. 
Therefore A'o must start or end on i/i between ci^nd 62. 



7^ X2 




Oi a X C( X 



Figure 15. 

But this contradicts the fact that there are no X(0)-edges between 61 and 63. This 
contradiction proves our lemma. □ 

Notice that for every x G X(0) there exists at most one relation in Vn{<S) which does 
not contain E(0)- or F(0)-letters but contains a. 

This relation has one of the forms x'^{axa~^)~^ = 1 or x'^{a^^xa)^^ = 1. Such 
relations will be called xa-relations. A cell corresponding to an xa-relation will be called 
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an xa-cell. The contour of every xa-cell has two a-edges with opposite orientation. Let 
us mark these edges and consider a-hands consisting of auxihary a-cells and these xa-cells 
with common a-edges. Notice that every y-band starting on an a^^-edge is an a-band, 
but a-bands are in general longer, they can contain some transition cells (xa-cell). An 
a-band in Sj starting on iji can end either on ti or on yi or on a contour of a transition 
F(0)-cell. 

We shall need two general lemmas about a-bands. 

Lemma 11.9 // a Y-hand y in a reduced diagram starts on the contour of an x-cell vri 
and ends on the contour of an x-cell tt2 and these two cells belong to the same X(0)-6anc? 
B then B contains an E(0)-ce// or an F-cell between vti and tt2. 

Proof. We can assume that vti is the first cell in B and it2 is the last cell in this 
band. Suppose that B does not have a E(0)-cell and does not have a F(0)-cell between 



TTi and TT2. Then by Proposition |4.1| all X(0)-edges of B are the same. Therefore all these 
cells correspond to the same xa-relation x'^{a^^xa^^)~^ therefore the labels of top(i3) 
and bot(i3) have the form (r^a*)^ where a, 6 G { — 1, 1}. Therefore all a-edges on top(i3) 
(resp. bot(i3)) have the same orientation. Therefore our F-band y cannot start and end 
on top(i3) and cannot start and end on bot(i3). But it also cannot start (end) on top(i3) 
and end (start) on bot(i3) because otherwise the maximal X(0)-band containing B would 
intersect the F-band y and a F-band cannot have transition cells. □ 

Lemma 11.10 A reduced van Kampen diagram overVN{S) without hubs cannot con- 
tain a-annuli. 

Proof. Indeed, let A be an a-annulus. If A does not have X(0)-cells then it is a 



F-annulus which is ruled out by Lemma |7.9| . So A must contain an X(0)-cell. Thus it 



has at least 2 X(0)-edges on its inner contour. Therefore a X(0)-band B intersects the 
annulus A twice. Then B cuts A into two parts, Ai and A2- We can assume that Ai 
does not contain X(0)-cells. Then Ai is a F-band starting and ending on the contour 
of an X(0)-band B. By Lemma |11.9| this band must contain an E(0) or an F(0)-cell vr 



between the two intersections with A. Since A cannot have E(0)-cells or F(0)-cells, vr 
must be inside the subdiagram bounded by the median of A. But then the maximal E(0) 
or F(0)-band containing tt must intersect with A. This is a contradiction since A cannot 
contain E(0) and F(0)-cells. □. 

Now we can continue painting edges of Ui and t^. 

If an a-band starts on a green edge of yi and ends on y^^ then we paint the inverse of 
the end edge (it belongs to yi) green also. 

Lemma 11.11 The number of green edges on y^ cannot exceed 8 times the number of 
green edges on ti. 

Proof. Indeed, by the definition of green edges, at least half of the green edges e on 
yi have the property that the F-band 3^ starting on e ends on a green B-cell p. If e is 
such an edge then we associate with e the green edge on ti which is the start or the end 
edge of the maximal G-band containing p (whichever of these two edges belongs to t^; if 



87 



both start and end edges of this 9-band are on ti then we take the leftmost edge). By 



Lemma p.l.8| at most 4 green edges of yi are associated with the same green edge on tj. 
Therefore the total number of green edges on yi does not exceed 8 times the number of 
green edges of ti. □ 

Now we need more colors. 

If an a-band starting on yi ends on ti then we paint the start and the end edges of 
this band in yellow. The following lemma is obvious. 

Lemma 11.12 The number of yellow edges on yi coincides with the number of yellow 
edges on ti. 

Let us denote the maximal E(0)-band starting on the E(0)-edge of yi by S, the maximal 
F(0)-band starting on yi by JF and the maximal X(0)-band starting on yi by X. 

If an a-band A starting on a non-green and non-yellow edge e of yi ends on the contour 
of a red F(0)-cell such that the maximal F(0)-bands JF^^ containing this red cell are not 
JF^^ then both start and end edge of J-'e belong to tj (an F(0)-band cannot cross a K-band). 
We paint e and the start and end edges of JFg in blue. See Fig. 16. 



Lemma 11.13 No more than two maximal a-bands starting on blue edges of yi 
end on the contour of the same F{0)-band. 
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Proof. If there are three such maximal a-bands then at least two of them have start 
edges on the same side of the X(0)-edge of yi. Let Ci and 62 be blue edges of (t/i)^^ which 
are on the same side of the X(0)-edge of ef ^ preceding ef^ on yi, let Ai and A2 be 

the maximal a-bands starting on ei and 62 respectively. Then the edges ei and 62 either 
both belong to yi or both belong to {yi)~^ (if these edges were on opposite sides of the 
X(0)-edge of yi, one of them could belong to yi and the other one could belong to (?/j)~^). 

Suppose that Ai, i = 1, 2, ends on the contour of a red cell pi. Let TZi and 7I2 be the 
red G-bands containing pi and p2- Let JF' be one of the two mutually inverse maximal 
F(0)-bands containing pi and p2. 

Notice that the F(0)-band JF' cannot intersect TZi or 7^2 twice by Lemma [7.4| . There- 



fore the (red) start edges of TZi and 7^2 are between the start and the end edges of JF'. 
Notice also that between the a-edge and the F(0)-edge on the top or bottom of a F(0)- 
cell considered as a G-band, there always exists a X(0)-edge. This implies that if one of 
the maximal X(0)-bands containing pi or p2 is inside the subdiagram E' bounded by J-'' 
and ti then the a-band Ai must intersect JF which is impossible. Therefore the maximal 
X(0)-bands containing pi (resp. P2) must be outside S' (recall that an X(0)-band and an 
F(0)-band cannot cross). 

Since the G-bands TZi and 7^2 cross 8(84^^-7)^^, and since i < AN — 10, TZi and 7^.2 
cross both K-bands B4^n-9 and B^-s in S4iv_9. It is clear that 7^.2 is higher than TZi (that 
is the intersection of TZi Bi+i has bigger number on Bi+i than the intersection of 7^.2 and 
S.+i) . 

The paths topiJZi) and bot(7^j) start on ti and cross Bi+i. On the boundary of the 
cell pj, j = 1,2 we can read either the word 

(x(0, T„ 4)F(0, T„ 4))^(x(0, T„ a)F(0, r„ a)a^i)-i 
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or the inverse of this word. In the first case we shall call pj positive^ in the other case - 
negative. 

Since S' does not contain the X(0)-edge of pj, either both the edge labelled by 
F(0,rj,4), and the edge labelled by x(0,rj,4) belong to the path top(7?.j)~^ or both 
of these edges belong to the path bot(7^j)~^, so they are not oriented toward Bi+i. Simi- 
larly the edge of pj labelled by F(0, Tj, a), and the edge labelled by a;(0, r^, a) belong both 
to the path bot(7^j)^^ or both to the path top(7^j)^^. 

Let e'j be the a-edge of pj. Then is the end edge of the a-band Aj. In every cell of 
Aj two a-edges form an opposing pair. Therefore e'j and Cj form an opposing pair of edges 
on the boundary of Aj. Since ei and 62 either both point toward Sj+i or both point in 
the opposite direction, the edges e'^ and e'2 must also have the same orientation meaning 
that if we trace the boundary of pi and p2 clockwise then we either pass through e'^ and 
e'2 or we pass through (e'^^)"^ and (e'j)"^. 

This implies that either both pi and p2 are positive or both are negative. Therefore 
the common 9+-edges of pj with B^jy^g either both point toward ym-<i or both point in 
the opposite direction. 

The subdiagram V of S47V-9 bounded by y^N-g on the bottom, 7^2 on the top, B^n^q 
on the left and -B4Ar_8 of the right is a sector. By Proposition we can assume that 
r is a computational sector. Let C = Wi, ...,Wm be the corresponding computation 
connecting Wi = Lab(top(7?.2) and Lab(?/4Ar_9). Then Lab(bot(7^i)) is one of the words 
Wn. Since there exists a computation of S connecting Wm and Wq, there also exist 
computations connecting Wi and Wn with Wq. The rules used in the transitions Wi 
W2 and Wn-i — > Wn are -R4,a(ri)^^ and -R4,a(T"2)^^- The fact that we proved in the 
previous paragraph shows that either both of these rules are positive or both are negative. 
Therefore part 5 of Proposition 0| applies either to the computation Wi,...,Wn or to 
the inverse computation. In any case, in this computation, there exists a transition 
We Wi+i such that the rule r applied in this transition contains a word Ex as one of 
its left sides where E e E(0), x e X(0). Let TZ^ be the maximal 0-band corresponding 
to this transition in the computational sector F. Let TZ^ be the maximal 0-band in \E'i 
containing TZ^. 
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Figure 16. 



Let Fi be the subdiagram of bounded by Bi+i, TZi, 7^2, Let Xi be the maximal 
X(0)-band in Fi starting on the boundary of pi. Then A\ cannot have two common cells 
with TZi, it cannot cross Bi+i and it cannot cross J-"' (it can only touch J-"'). Therefore Xi 
crosses 7l2- Since 71^ is between TZi and 7^2, it must cross Xi. Let p be the intersection cell. 
Then p corresponds to the relation containing both an X(0)-letter and r. By definition of 
Vn{S) there is only one (modulo taking inverses and cyclic shifts) relation which contains 
both an X(0)-letter and r. This relation or its inverse is written on the boundary of the 
cell in TZ^ which contains an X(0)^^-edge. By the choice of r, this relation contains an 
(E(0))-letter. Therefore the boundary of p contains an E(0)-edge. Thus Fi contains an 
E(0)-edge. Let Si be the maximal E(0)-band in Sj containing this edge. The E(0)-band 
Si cannot cross JF' or Bi^i. By Lemma [^2] , Si is not an annulus. Therefore Si crosses 
TZi between pi and Since the E(0)-band Si cannot cross Ai, either the start or the 

end edge of Si must belong to yf^. This edge must be between ei and the (F(0))^^-edge 
of (i/i)^^. But (i/i)^^ does not contain (E(0))^^-edges between the a^^-edges and the 
(F(0))^^-edge. This contradiction completes the proof of our lemma. □. 

This lemma immediately implies the following result. 

Lemma 11.14 The number of blue edges on yi does not exceed the number of blue 
edges on ti. 

We continue painting the edges of yi. We shall paint an a^^-edge e of yi in pink if the 
a-band starting on this edge ends on the contour of a green G-cell. 

Lemma 11.15 There are no more than two a-bands starting on {yi)"^^ and ending on 
the boundary of the same green Q-band. 



90 



Proof. The proof is similar to the proof of Lemma |11.8| . Suppose that there are three 
such a-bands ^i, A2, A3 and the start edge of Ai precedes the start edge of A2 which 
precedes the start edge of ^3. Let pi, p2 and ps be the green cells containing the end 
edges of ^1, A2 and ^3 respectively and belonging to the same 0-band TZ. Then the 
maximal F(0)-band JF2 containing p2 cannot cross TZ again and cannot cross Ai and ^3. 
Thus J-'i must start or end on [i/i)^^- This contradicts the fact that yi cannot contain an 
(F(0))^^-edge between two a^^-edges. □ 

This lemma implies the following fact. 

Lemma 11.16 The number of pink edges on yi does not exceed twice the number of 
green edges on ti. 

Our next goal will be to prove that every a^^-edge on yi is either green, yellow, blue 
or pink. 

Let us suppose that it is not so. Then there exists a colorless edge e G (jji)^^ labelled 
by a such that the maximal a-band starting on e ends either on an edge e' of {yi)^^ or 
on the contour of a F(0)-cell which belongs to the F(0)-band JF and to a red 0-band. In 
the first case e' is colorless (it could be only green but then e would also be green and 
we assumed that e is colorless). In this case we paint both e and (e')~^ in brown. In the 
second case we paint e in black. 

Notice that if there are brown or black edges on yi then ti contains red edges. Therefore 
there exists a 6-band in \E'i which starts on ti and crosses B4N-7. 

Lemma 11.17 yi does not contain brown edges. 

Proof. Suppose that yi contains a brown edge e. Let e' be the end edge of the maximal 
a-band A in Sj starting on e. As we know, e' is also brown. Since e belongs to yi, e' 
belongs to y^^ and the labels of e and e' are the same. Therefore these edges cannot be 
on the same side of the X(0)-edge of yi (since the label of yi is a reduced word). Thus one 
of these edges is between the E(0)-edge and the X(0)-edge and the other one is between 
the X(0)-edge and the F(0)-edge. Suppose that e is between the E(0)-edge and the X(0)- 
edge. The case when e' is between the E(0)-edge and the X(0)-edge is completely similar 
(although not identical because we have the additional assumption that e E yi). 

Since e E yi and the label of e is a, the part of the word Lab(|/i) between the E(0)-letter 
and the F(0)-letter has the form a'^xa" where m > 0, n < 0. 

Since e is not a green or yellow edge, the F-band starting on e ends on a red 0-cell. 
Let TZ' be the maximal red G-band which contains this cell. Since e is to the left of the 
X(0)-band X, TV intersects X . 

Let p be the first cell in X (its contour has a common path with y^^). Let IZ be 
the maximal B-band in \I'i containing p. Since TZ cannot intersect TZ' , it must cross 
bot(i34Ar_7). So IZ' is a red 6-band ( see Fig. 17). 
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Figure 17. 



Let S' be the diagram bounded by IZ, B4N-9, ^m-s, y4N-9- Then S' is a sector which 
we can assume to be a computational sector. This sector has only two mutually inverse 
maximal B-bands (one of which is a part of TZ). Indeed, if there were a G-band TZ" in 
S' below TZ then the maximal B-band in \E'i containing TZ" would intersect A" and the 
intersection cell would be lower than p. Thus the computation corresponding to the sector 
S' consists of one transition Wi — > W2 where Wi = Lab{y4N-9)- Recall that the a-part of 
Lab(?/4jv-9) is equal to a'^xa"' where n 7^ and m ^ 0. Hence the only rules applicable 
to Wi are rules of the form x'^{a'^xa~'^)~^ . If e = — 1 then < a contradiction 

with property P7. Thus the boundary of the cell in E' which has an X(0)-edge has label 
Taxa^^T^^x~^. Denote this cell by p". Since the cells p and p" belong to the same 0-band 
and their X(0)-edges have the same orientation on si, on the boundary of p one can read 
the same word. Let Ai and A2 be the maximal a-bands in Ej which start on the two 
a-edges ei and 62 of the contour of p where ei belongs to the subdiagram bounded by £, 
X, Hi and and 62 belongs to the complement of this subdiagram in Ej. These a-bands 
cannot intersect the a-band A. Therefore they must end on yf^ (a-annuli are ruled out 



by Lemma |11.10|) . The a-band ai cannot end to the left of the X(0)-edge of yi because 
the start and the end edges of an a-band must form an opposing pair. Therefore the end 
edge of Ai is to the right of the X(0)-edge of Similarly the end edge of A2 must be to 
the left of this X(0)-edge. This implies that Ai and A2 must intersect, a contradiction. 

□ 

Lemma 11.18 yi does not contain black edges. 

Proof. Let e be a black a-edge on yf^. Let A be the maximal a-band starting on 
e, and let p be the cell from JF on whose boundary A ends. Let TZ be the maximal red 
9-band containing p ( see Fig. 18). 
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The subdiagram T bounded by B4N-9, Bans, and Van -9 is a sector and by Propo- 
sition |9.1| can be converted into a computational sector without changing the bound- 
ary or increasing the area. So we can assume that F is a computational sector. Let 
C = {Wi, ...,We) be the corresponding computation where Wi (resp. W2) is the label 
of the part of topiJZ) (resp. bot(7^)) between BiN-9 and We = Lah{y4N-9) = 

Lab(?/j) = W. The label 6 of the 0-edges of TZ is an S-rule which involves a and an 
F(0)-letter. Therefore this rule is R^^ai'^) for some r. Since there exists a computation 
of S connecting We and Wq (this is the computation corresponding to the computational 
disc n), there exists a reduced computation connecting Wq and Wi. By Proposition 
either Wi or W2 is a positive word and the degree of a in Wi and W2 is positive. 

Suppose that We contains a~^. Let Wm, m < £,he the last word in the computation C 
which does not contain a~^. Then the a-part of Wm+i has the form a°'xa^ where x G X(0), 
ab < 0. Therefore only xa-rules are applicable to Wm+i- Since the computation C is 
semiproper, all words in the computation Wm+i, ■■■,We contain a~^, so in this computation 
only xa- rules can apply and each of them inserts a new a~^. Since xa-rules do not affect 
non-a parts of admissible words, the length of We is greater than the length of W^+i 
provided m + 1 < i. This contradicts Property P7. Thus m + 1 = i. The only rules 
which can insert are -R4,a(T) and xa-rules. Thus one of these rules is applied in the 
transition Wm We. But if -R4,a(r) applies in this transition then the degree of a in 
Wm should be 0. Since Wm is normal and positive, it must contain no F-letters. Then 
|Wm+i| = \ Wm \ + 4 which contradicts Property P7. 

If an xa-rule is applied in the transition Wm — > Wm+i then = \Wm\ + 2 and 

we again get a contradiction. 

Thus the a-part of We does not contain a~^, so all a^'^^-edges of yiN-9 have label a. 
Since the labels of yi and yAN-9 are the same, all a^^-edges of have label a. 
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Figure 18. 



Since e has label a, it must belong to The start and end edges of every a-band 
form an opposing pair on the contour of this band. Therefore the end edge of A must 
belong to the inverse of the boundary of the cell p. Therefore on the boundary of p, one 
can read F~^x~^'~f~'^a'^x'F''-f where 7 G 0+, x,x' G X(0), F,F' G F(0). Therefore the 
same word can be read on the boundary of the intersection p' of TZ and the F(0)-band 
J-'' of r starting on i/iN-g- Hence the rule applied in the transition Wi — > W2 in the 
computation C is -R4,a(r)~^ for some r. Hence the length of W2 is greater than the length 
ofWi (Proposition ^A\). 

Let 7l[, ...,7l'^_i be the maximal 9-bands of F starting on the contour of B4N-9 counted 
from top to bottom (so ym-9 = bot(7^^_i) ) and let 7^i,7^2, ■■■y'JZe-i be the red 9-bands 
of containing 7l[, ...,7l'g_i (so 71 = TZi). Then W2 is written on the bottom of TZ'^. 
Let e'l be the last a-edge of hotiJZ'i). Let A'l be the maximal a-band in F starting on 
e'l- If A'l ends on ym-d then the degree of a in Wg is greater than the degree of a in Wi. 



Since Wi is a positive word (by Proposition |4.1| ), We is longer than Wi which contradicts 
Property P7. 

The a-band A'l cannot end on the contour of TZi because then Wi would contain 
a~^. Therefore the only possibility is that A'l ends on the contour of the F(0)-band JF'. 
Let the F(0)-cell p'l on the contour of which this band ends belong to the 9-band TZj^^. 
Considering the orientation of a-edges of bot(7?.i) and the orientation of p[ we conclude 
that the rule applied in the transition Wj^ ^ji+i the computation C is -R4,a(ri) for 
some Ti. Let pi be the intersection of 7?.j^ and JF. Then the maximal a-band Ai starting 
on the a-edge of the contour of pi cannot cross A and cannot end on yf^ (because all 



94 



a^^-edges of yi have label a). Therefore Ai must end on the contour of an F(0)-cell p2 
from JF. This cell must belong to a red 0-band TZj^ for some j2 between 1 and ji. Let 
p'2 be the intersection of TZj^ with JF'. Then the a-band ^2 starting on the a-edge of the 
contour of ends on the contour of some cell pg of (it cannot cross Al^), and the red 
B-band TZj^ containing this cell intersects JF at some cell p^. Continuing in this manner 
we shall construct a sequence of different cells pi,p2,P3, ••• of JF. This process obviously 
cannot stop which contradicts the fact that JF contains only finitely many cells (see Fig. 
18). This contradiction completes the proof of the lemma. □ 

Thus we have proved that every a^^ edge of Ui is either green or yellow or blue or 
pink. This allows us to prove the following statement. 

Let rii be the number of non-red edges of ti. Recall that W denotes the label of yi. 

Lemma 11.19 \\W\ \ = ||Lab(yj)|| < AOrii + 15k + 6 < Glkui (where k is the number 
of tapes of the machine M). 



Proof. Indeed, by Lemmas |11 . 1 1| , |11.12| , |11.14| , |11.16| , the number of green and blue 



edges on yi does not exceed 8 + 2 = 10 times the number of green edges of tj, the number of 
yellow edges of yi does not exceed the number of yellow edges of ti and the number of pink 
edges of yi does not exceed the number of pink edges on ti. Therefore the degree of a 
mW = Lab(?/j) does not exceed lOrij. Since W is a normal word, | \W\ \ = Ami + 15/c + 6. 
This implies the inequalities of the lemma. □ 

We shall also need to estimate the length of W . 

Lemma 11.20 \W\ < (30A; + 12)|pi| 

Proof. Indeed, every F-band in starting on y^ [i between 1 and 4N — 4) ends 
either on pi or on the contour of a maximal B-band in Since every maximal B-band 
either starts or ends on pi, it is enough to show that the number of maximal F-bands 
in ^1 starting on yi and ending on the contour of the same B-band does not exceed 
30 A; + 12. The proof is similar to the proof of Lemma |11.8|. Suppose that there exist 



30k + 13 y-bands starting on y^ and ending on the contour of the same B-band TZ. Since 
the number of Q-edges on yi is 15k + 6, there exist three F-bands 3^1, 3^2, 3^3 ending on 71 
and starting on edges ei, 62, 63 of yi such that ei precedes 62 which precedes 63, and there 
are no Q-edges between ci and 63. Then the maximal Q-band containing the intersection 
of 3^2 and TZ must end or start on yi between ci and 63 because it cannot cross TZ twice 
and cannot cross 3^i and 3^3. This contradicts the fact that there are no Q-edges between 
ei and 63 on sj~^. □ 

Now given our decomposition (\E'i, \1'2, 11) of A we define a new decomposition (F, E, 11) 
in the following way. Let E' be inverse of the union of subdiagrams Sj where j = 
4N -7,4:N - 6, 4A^ - 5, 4A^ - 4 with the K-bands B^n-t and B^n-s removed. Then 

9(s') = u^wu2{pr' 

where Ui = bot(i347v-7), p' C pi, U2 = top(i347v-3), q' C sj"^. Notice that Lab('U2) = 
Lab(bot(i34/vr_3)) and Lab(g') is equal to Lab(s^^) if we change indices of the labels of 
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K-edges. Let us make this change of indices in the whole S' and denote the resulting 
diagram by S. 

Then we can glue S and along bot(;B4Ar_3). The resulting diagram will be denoted 
by r. We also can glue and \E'2 along S2 ^bot(S4Ar_3). The resulting diagram will be 
denoted by E. 

Notice that 

d{T) = hot{Bi)piqus'si 
where |g| = Y.'j=4N-7 + 3 < \pi\, Lab(M) = Lab(bot(i34Ar_7))"^, Lab(s') = Lab(s^^); 

d{E) = hot{l3i)-\-\-^p2. 

It is clear that by gluing E, T and 11 we can get a diagram with the same boundary 
label as A. It is also clear that the number of hubs in E is h{A) — 1 and F does not have 
hubs. 

We shall call the triple (F, E, 11) a decomposition of type 2 of A. We can now estimate 
the perimeters of F, E, U. 

Lemma 11.21 Let the boundary label of the computational disc U be K{W) for some 
word W . Then there exist positive constants ei and €2 such that 

1. n(F) < ein(A) and 

2. n{E) + e2\\W\\ < n(A). 

Proof. We have that 

n(F) = |top(i3i)| + |si| + |s2| + bi| + |g| + 



n{E) = |top(i3i) + \u\ + \q\ + \p2\. 

The number |top(Si)| is equal to the number of orange edges on pi. 

The path u consists of 9-edges (recall that Lab(u) = Lab(top(;B4Ar_7))). No maximal 
0-band 7?. in F starting on u can end on tj where j > AN — 7: this follows from the fact 
that F - ^1 is symmetric to the union of S^, j = AN - 7, AN - Q, AN - 5, AN - A (with 
B4N-3 removed). Thus it ends either on tj for some j < AN — 7 or on q. In the first 
case the end of 7^ is a red edge. In the second case let e be the end edge of 71. Since q 
with K-letters removed is symmetric to the union of the tj's, j = AN — 7, ...,AN — 4, there 
exists a symmetric edge e' on one of these tj. Then the maximal G-band TZ' starting on 
e' must cross B4N-7, therefore it ends on a red edge of one of the tj's, j < AN — 7. The 
end edge of 71' is red. Thus we have a one-to-one correspondence between edges of u and 
some red edges on pi. Therefore the length of u does not exceed (in fact one can prove 
that it equals) the number of red edges of pi. 

The length of Sis^^ is 4iV|iy| -|- 4A^. Therefore by Lemma |11.20 



\si\ + \S2\ < AN{30k + I2)\pi\ +AN < AN{30k + 13)|pi 
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Thus 

^(r) < bil + 4:N{30k + 13)|j9i| + 2|j9i| < ein(A) 

for some ei. 

Notice that n(A) — n{E) = \pi\ — |top(i3i)| — 1^1 — |g|. Since the |top(i3i)| is equal 
to the number of orange edges on pi, \u\ does not exceed the number of red edges on 
Pi) ~ |top(i3i)| — \u\ exceeds the number of green, yellow, blue and pink edges of 
Pi plus the number of colorless edges. By construction |g| is equal to the sum of the 
lengths of the t/s, j = 4N - 7,4N - 6, AN - 5, AN - A plus 3 (the K-edges). Notice 
that the edges of these tj are colorless (because we removed the red paint from these 
tj and picked i < AN — 7 when we starting to paint in different colors). Therefore 
n(A) — n{E) = \pi\ — |top(i3i)| — \u\ — \q\ exceeds the sum of the rij's for all odd i from 
2A'" + 1 to AN — 11, where rii is the number of non-red edges of ti plus the number of 



K-edges between ti and t4iv_7 minus 3. By Lemma 11.19 each of these rii is greater than 



I |/(61/c). The number of K-edges between ti and t4Ar_7 is greater than 3, so 

n{A)-n{E) > | |/(61A;). 

This completes the proof of our lemma. □ 

The following proposition gives the upper bound for the Dehn function of the group 
GNiS). 

Proposition 11.1 If w = 1 in G]\i{S) then there exists a van Kampen diagram A{w) 
over Vn{,S) with boundary label w, area < Ci ■ T(c2 ■ 1^1)"^ and diameter < c[ ■ T(c2 ■ Iwl)'^, 
for some positive constants Ci , C2 and c[ , . 

Proof. By the van Kampen Lemma there exists a reduced diagram A over Vn{S) with 
boundary label w and minimal possible number of hubs. Then we can apply the snowman 
decomposition and obtain three sequences of diagrams Ei,...,Es, ri,...,rs_i, ni,...,n5_i, 
with the following properties: 

(51) E, = A. 

(52) El contains no hubs. 

(53) For every i = 1, s — 1, we have: either (Fj, Ei) is a decomposition of type 1 of Ei+i 
(and Hi is empty); or (Fj, Ei, Hi) is a decomposition of type 2 of Ei^i (in particular, 
Hi is a computational disc). 

This implies that we can glue together the diagrams Ei, Fi, Fs_i, Hi, H^.i so as 
to obtain a diagram with boundary label w. Therefore in order to find the upper bound 
for the area of A we need to estimate the sum of the areas of Ei, Fi, F^-i, Hi, n^-i. 



Property (D5) of decompositions of type 1, and part 2 of Lemma |11.21| show that for 
every i = 1, s — 1, 

n{Ei) < n{Ei+i). 

Therefore 

n{Ei) < \w\, i = 1, ...,s. 
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and s < \w\. 

The diagram Ei and for every i the diagram Fj contain no hubs. So by Lemma |Tl 

area(ri) < CiniTif 

for some constant Ci and 

area(Ei) < 2Cin{Eif. 
We also know that n{Ei) < \w\ and by Lemma |1L21| and property (D5), n(rj) < 



C2'n{Ei^i) < C2|w|, for z = 1, s — 1 and some constant C2. Therefore 

(31) 



area(i?i) + area(ri) + . . . + area(rs_i) < 
C,n{Eif + CmiT.f + CmiT^f + ... 

+Ci(r,_i)3 < 



for some constant C3. 

It remains to estimate the sum of the areas of discs Hj. We can assume that each 
Hi corresponds to a minimal area computation of S connecting some word Wi and Wq- 
Therefore the area of each llj does not exceed "big O" of the area of the corresponding 
computation. By Proposition the area of each Hj does not exceed C4T(| 1)"^ + 



C5?T,(nj) for some constants C4 and C5. 

area(ni) + area(n2) + . . . + area(ns-i) < 
C,{T{\\W,\\r + ... + T{\\W,.,m)+ 

C5(n(ni)3 + n(n2)3 + ... + n(n,_i)3) < 

C,T{\\W,\\ + ... + WWs-iWr + C,{n{U,f + ... + n{U,.,f). 

Here we used the superadditivity of the function T{nY. For every i from 1 to s — 1, 
if (Fj, Ei, Hi) is a decomposition of type 2 of then by Lemma |11.21| (2) we have 



\\W,\\<C^{n{E,^^)-n{Ei)) 
for some constant C^. Therefore 

111^111 + ... + \\Ws-i\\ < CQn{A) = Cq\w\. 
By Lemma |11.20| n{Ili) < C^n{EiJ^i) < Cs\w\ for some constant Cs- Therefore 
niUif + ... + niUs-if < Cg\w\ ■ \wf = Cg\w\\ 

Thus 

area(ni) + area(n2) + . . . + area(ns_i) < 
C,T{Ce\w\)' + C,o\w\\ 



(32) 



for some constant Ciq. 

Combining (pTf) and (|3^) we get 



area(A) < Csl^l^ + CiT{CQ\w\y + Cio\w\^ < CiiT{C_ 



'i2\w\Y. 
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for some constants Cn and Cu- This completes the area part of the proposition. 

Now let us estimate the diameter d{A). If A = Ei that is if A has no hubs then we can 
apply Lemma and conclude that d{A) < 0{\w\). If A contains hubs, we can use the 
snowman decomposition and obtain three sequences of diagrams Ei,..., Es, Fi, ...,rs_i, 
ni,...,ns_i as above. Let d{Ti), d{Ei), dijli) be the diameters of the correspomding 
diagrams. It is easy to see that 

d{A) < max,=i,...,,_i{rf(Ei) + E-Zl \d{Tj)\ + E |5(n.-i)|, 

rf(r,) + \diE,)\ + E-zl |5(r,)| + (33) 

\dm + \diE,)\ + Efj, |9(r,)i + E'-i m,)\}. 

Indeed, take a vertex inside the diagram A' obtained by gluing together Ei, all Fj's and 
all Ilj's. It belongs to one of these blocks. Take the shortest path to the boundary of this 
block, then we can get to the boundary of A' going along the boundaries of the blocks. 
The resulting path will have length bounded by the right part of (^). 

We already know the upper bounds for |i9(rj)| and |(9(nj)|. By Lemma |11.21 these 



do not exceed 0{n{A)). Since s < n(A), the sums EjZl \d{Tj)\ and \d(n.j)\ do not 
exceed 0(?t,(A)^). By Lemma the diameters of diagrams Ei and Fj [i = 1, s — 1) 
do not exceed "big O" of their perimeters. By Lemma |11.21| these perimeters are smaller 



than 0(n(A)). Thus the diameters of Ei and Fj do not exceed 0{n{A)). By Proposition 
|10.1| , the diameter of the disc Ilj does not exceed 0(T{\d(Ili)\Y). We also know that 
|f^(ni)| < 0(n(A)). Therefore the diameter of Ilj does not exceed C[T{C2\w\Y for some 
constants C[ and Combining this information with (^) we conclude that 

d{A) < C[T{C'2\w\f + C'^n^ 

for some constant C3. Since T(n) > 0{n), we deduce that 

d(A) < c;T(4ki)^ 

for some constants c[ and C2 as desired. 
The proposition is proved. 

□ 



12 The Lower Bound 

The goal of this section is to prove that the group Gn{<S) simulates the machine S, that 
the Dehn function of G'Ar(5) is bounded below by a function equivalent to r(n)^, and that 
the smallest isodiametric function is bounded below by a function equivalent to T(n)^. 
We start with the following lemma. 

Lemma 12.1 Fix an admissible word W for S. We call a diagram A over the pre- 
sentation Vn{<S) quasi-sector for W if the boundary of A can be represented in the form 
biSib2^S2^ where bi is the top path of a Ki-band Bi, 62 is the bottom path of a K2-band 
B2, and both bands start on $2; Lab(si) = k,iWk,2, Lab(s2) = hiWqk,2; all hubs of A are 
on Bi. 
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1. Suppose that A is a quasi- sector for W which either has the smallest possible area 
among all quasisectors for W or has the smallest length of Bi among all quasi-sectors for 
W . Then A is a sector. 

2. Suppose that A is a quasi-sector for W . Then the label of hot{B2) is the history 
word of a computation of S connecting W and Wq. 

Proof. Suppose that Ei does not have hubs. Then since Si and S2 do not have 0- 
edges, every maximal B-band which starts on hi ends on 62 and every maximal 0-band 
which starts on 62 ends on hi. This implies that si is the top path of a 0-band starting 
on hi and ending on 62- Thus by definition, A is a sector. 

Now suppose that Bi contains hubs. Let us number these hubs tti, 7r„ along Bi. Let 
TTj be one of these hubs. The path hi contains exactly one ^Tv+i-edge from diji). Let /Cj 
be the maximal K^v+i-band of A starting on this edge. Since A does not have hubs outside 
Si, each /Cj ends on the boundary of some hub tTj. Since Kjv+i-bands cannot intersect, it 
is easy to see that there must exist a number i from 1 to n — 1 such that /Cj ends on TTj+i. 



W B2 




Figure 19. 



Consider the subdiagram S of A bounded by Bi, /Cj, and the contours of the two 
hubs TTj and vTj+i such that S does not contain vr, and TTj+i (see Fig. 19). Let i9(S) = 
^^'2(^2)'^^ i^'i)^^ be the corresponding division of the boundary of S where h[ C 61, s'l C 
di^TCi), S2 C d{TTi+i)~^, h'2 C bot(/Ci). The diagram S does not contain hubs and has A^ + 1 
K-bands Ti, T/v+i, connecting edges on si with edges on S2 where Ti C Bi, T^+i = /Cj. 
An argument similar to that used in the proof of Lemma |11.1| shows that S is a union of 
sectors and inverses of sectors corresponding to the same computation C. By Proposition 
9TT| we can assume that these are computational sectors. As in the proof of Lemma |11.1| 
consider the computational disc LI corresponding to the computation C. We can identify 
S with a part of LI and vTj with the hub of LI. Let L = LI — S — tt^. Cut A along the 
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path {b[) ^s[b2 where Si ^si = d{ji), and glue in the resulting hole the diagrams V and 
r^^. Then 11 becomes a subdiagram of the resulting diagram A'. Then again as in the 



proof of Lemma we notice that the boundary label of 11 is the same as the boundary 
label of the hub, so we can replace the subdiagram 11 in A' by the hub vTj. After that 
TTj and TTj+i will form a cancellable pair of cells, so we can cancel them. Let A be the 
resulting diagram. Let Bi be the Ki-band in A connecting si and S2- Finally let A be 
the subdiagram of A bounded by Bi, Si, S2, B2. It is easy to see that the area of A is 
smaller than the area of A and the length of Bi is smaller than the length of Bi. Indeed, 
A is obtained from A by removing sectors of 11 and two hubs and inserting N sectors 
of n. Therefore area(A) < area(A) — 2; the band Bi is obtained from the band Bi by 
removing two hubs, so the \Bi\ = \Bi\ — 2. This contradicts the assumption that A is a 
quasi-sector for W which either has the smallest possible area among all quasisectors for 
W or has the smallest length of Bi among all quasi-sectors for W. This comtradiction 
proves part 1 of the lemma. 

In order to prove part 2 let A be a quasi-sector for W. Then reducing the number 
of hubs of Bi as above we can transform A into a sector A with W as the label of top 



path. By Proposition ^3] we can assume that A is a computational sector. Notice that 
our transformations do not affect B2- Therefore the label of bot(;B2) is the history word 
of the computation corresponding to the sector A. □ 

The next proposition shows that the group Gat (5) simulates the machine S and pro- 
vides a lower bound for the Dehn function of this group. 

Proposition 12.1 1. For every admissible word W of the machine S there exists a 
computation of S connecting W and Wq if and only if K{W) = 1 in the group Gn{S). 
2. The Dehn function ofG^lS) is bounded below by a function equivalent to T(n)^. 

Proof. If there exists a computation of S connecting W and Wq then the computa- 
tional disc corresponding to this computation has boundary label K(W), so K(W) = 1 
in the group Gn{S). 

Conversely suppose that K{W) = 1 in Gn{S). Then there exists a van Kampen 
diagram A over the presentation Vn{S) with boundary label K{W). Our goal is to prove 
that there exists a computation connecting W and Wo and to estimate the area and 
diameter of A from below. Thus we assume that the area of A is minimal. Then A is 
reduced. Let Cj, i = l,...,4iV be /tf^-edges on 5(A), we assume that ei precedes 62,..., 
precedes e^jy on 9(A). Let /Cj, i = 1, 2N be the maximal Kj-band starting on e^. Since 
the contour of A contains two ni edges and two K2-edges, /Ci and /C2 intersect. Therefore 
A contains hubs. 

Remark. Now it is tempting to use the graph theoretic lemmas from the previous 
section, to deduce that A contains just one hub, and then prove that A is a disc. But 
unfortunately we cannot use the lemmas from the previous section because we cannot 
assume that A has minimal number of hubs: this assumption may contradict the minimal 
area assumption or the minimal diameter assumption. Thus our proof proceeds in a 
different direction. 
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Notice that for every i from 1 to 2N, A contains only two mutually inverse Kj-bands. 
Therefore each hub in A belongs to all /Cj. 

Let 7ii,...,7irn bc all the hubs in A, numbered along JCi. Let 7r„ be the first of these 
hubs appearing in /C2. Consider the subdiagram F of A bounded by /Ci, d{A), IC2 and 
(9(7r„) such that 7r„ ^ F. It is easy to see that F is a quasi-sector for W. 

By Lemma [12. 1| there exists a computational sector S with top path labelled by K1WK2 
and the bottom path labelled by K1W0K2 such that area(E) < area(F). By Proposition 



9T1| there exists a computation of S connecting W and Wq. This proves statement 1 of 
our proposition. 

Let S{n) be the space function of the Turing machine M. We know that S{n) is 



equivalent to T{n). Using the notation from Proposition ^TT] let us assume that W = a{c) 
where c is an accepted configuration of M of length < n such that any smallest space 
accepting computation of M for the configuration c has space S{n). Then by Proposition 
[4.1| , any computation of S connecting W and Wq has area exceeding CS'^{n) and time 
exceeding C'S^{n) for some positive constants C, C . Therefore by Proposition |9.1| the 



area of S exceeds 0{S'^{n)) and the diameter exceeds 0{S^{n)). 

Since area(A) > area(F) > area(S), the area of A exceeds 0{S{nY). Therefore S{n)^ 
is equivalent to a lower bound for the Dehn function of G]^{S). Since S^{n) is equivalent 
to T{nY, T{nY is also equivalent to a lower bound of the Dehn function of Gn{S). □ 

The next proposition provides a lower bound for isodiametric functions of Gn{S). 

Proposition 12.2 Every isodiametric function ofG]^{S) is hounded below by a func- 
tion equivalent to T{nY. 

Proof. Let be the "time function" of the machine 5, that is T{n) is the smallest 
number such that for every admissible word W of length < n, for which there exists a 
computation connecting it with Wq, the smallest length of this computation is < T{n). 
By the choice of iS, T{n) is equivalent to T(n)^. 

Let us fix a number n > 1 and let W be an adsmissible word for the S'-machine iS, 
\W\ <n. Suppose that there exists a computation connecting W and Wq and the smallest 
length of this computation is t{n). Then there exist a diagram (for example, computational 
disc) over Vn{S) with boundary label K{W). Let A be any (not necessarily reduced) 
diagram over the presentation Vn{<S) with boundary label K(W). Assume that A has 
the smallest diameter among all such diagrams. 



By Lemma ^]T] the operation of removing K-annuli does not increase the diameter of 
A. So we can assume that A does not contain K-annuli. This implies as before that 
every hub in A belongs to every K-band and that A contains 2 mutually inverse maximal 
Kj-bands for every i = 1, 2N. As in the proof of Proposition |12.2| , let Cj, z = 1, 4A^ 
be ftf^-edges on 9(A), we assume that ei precedes 62,..., precedes on d{A). Let /Cj, 
i = 1, 2A^ be the maximal Kj-band starting on Cj. Since the contour of A contains two 
Ki edges and two fi;2-edges, /Ci and /C2 intersect. Therefore A contains hubs. 

Let 7ri,...,TTm be all the hubs in A, numbered along /Ci. Let 7r„ be the first of these 
hubs appearing in K.2- As before, consider the subdiagram F of A bounded by )Ci, d{A), 
/C2 and (9(7r„) such that 7r„ ^ F. 
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Let Bi (resp. B2) be the Ki-band (resp. «;2-band) of F starting on the contour of A. 
Let fi be the end edge of Bi {i = 1, 2). Let v be the initial vertex of the «;2-edge /2 and 
let p be the shortest path connecting v with the boundary of A. 

In every pair of mutually inverse maximal 0-bands in A, call one band positive and the 
other one negative. Consider the set of all positive maximal ©-bands of A intersecting 
62- At most 1 5(A) I of these bands start and end on the boundary of A. Therefore 

— I (5(A) I bands in fi are O-annuli. Our goal is to calculate the number of O-annuli 
in fi which contain v in their inside diagrams. It is clear that the length of p cannot be 
smaller than this number. 

Notice that given A, F, B2, v and \p\ are determined uniquely. Without loss of 
generahty we assume that A has the smallest possible pair (|p|, IB2I) (in the lexicographic 
order) among all diagrams with boundary label K{W). 

Suppose that p contains at least two K^^-edges of 82- We shall show that this leads 
to a contradiction. 

Then p contains two edges Ci and 62 which are common edges of B2 or their inverses 
and such that between ei and 62, p does not intersect the median of 82- Let ti be the 
initial vertex of ei and let t2 be one of the vertices of 62 which belong to the same side 
of B2 (top ot bottom) as ti. Then there exists a reduced path q which connects ti and ^2 
and is a subpath of top(i32)^^ or hot{B2)^^. 

Claim. Lah{q) is a reduced word. 

Indeed, suppose that q contains a subpath ee' where e and e' are edges with mutually 
inverse ©"^^-labels. Then the corresponding cells pi and p2 of B2 form a reducible pair. 
The common edge of this pair does not belong to p"^^ because p does not have fi;2-edges 
from B2 between ei and 62. Therefore reducing this pair does not affect the length of p 
(if this edge belonged to p then reducing this pair of cells would make the shortest path 
connecting v and d{A) longer). When we reduce pi and p2, we get a diagram with smaller 
pair (|p|, \B2\) which contradicts our choice of A. 

Now let p' be the portion of p which connects ti and ^2- Then {q~^p')^^ bounds a 
subdiagram of A. Therefore Lab(p') = Lab(g) in Gn{S). There exists a homomorphism 
from Gn{<S) to the free group generated by which takes all non-O-letters to 1. Therefore 
if we remove all non-0-letters from p', we get a word U which is freely equal to Lab(g). 
Since by Claim Lab{q) is reduced, \U\ > \q\. Notice that Lab(p') contains a non-6-edge 
ei. Therefore \p'\ is strictly greater than \q\. Therefore we can substitute p' by g in p and 
get a shorter path connecting v with 9(A), a contradiction with the choice of p. This 
contradiction shows that p contain at most one ^2 ^-edge of 82- 

Suppose that p contains one Kf^-edge e oi 62- One of the vertices v' of e belongs to 
the same side (top or bottom) of B2 as v. Let q be the subpath of bot(i32)^^ connecting 
V and v'. As in the proof of Claim one can show that Lab(g) is a reduced word and that 
the part p' of the path p connecting and v' is not shorter than q. Let v be the end 
vertex of the K2 edge /2 and let v' be the other vertex of e. Then the vertices v and v are 
connected by a subpath q of top(B2)~^. Notice that as before we can assume that the 
word Lab(g) is reduced. So the lengths of q and q are equal. Thus we can substitute the 
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subpath p' of p by f2q and get another path, say, p, which connects v with the boundary 
of A, has length not bigger than the length of p, and contains only one common fi;2-edge 
with B2, the edge f2- Thus we can assume without loss of generality that the path p has 
these properties. 

Now if B2 contains reducible pairs of cells we can cancel them without affecting the 
length of p. 

In the case when p does not have common K^^-edges with B2, we can reduce B2 without 
changing the length of p. 

Thus we can assume that B2 is reduced. Now we can apply Lemma |7]5| and deduce 
that no 6-band from Q can have two intersections with B2 (otherwise B2 would not be 
reduced). Thus the number of 0-bands in f2 is |i32|. 

Every G-annulus in Q must contain v in its inner subdiagram. Indeed, the bottom 
path of B2 crosses this annulus exactly once, so the index of the median of this annulus 
(which is by definition a simple closed curve) with respect to the point f is 1. 

Therefore the number of annuli in Q is at least |i32| ~ |f^(^)l- Let us reduce the diagram 
A by first reducing the band Bi and then reducing other reducible pairs of cells. Let T 
be the reduced diagram obtained as a result of this process. Then F is a quasi-sector. By 
Lemma 3^ the band B2 does not change during the reduction process in F. Therefore 
by Lemma |12.1| Lab(i32) is a history of a computation connecting W with Wo- Therefore 
by \B2\ > T{n). Therefore the number of B-annuli in A containing v is not smaller than 
T {n) - \K{W) I > T(n) - ANn - AN. Hence the length of p is at least T {n) - ANn - AN. 
Since the length of d{A) does not exceed ANn + AN, we deduce that the isodiametric 
function of GjviS) is y T{n) = T{nf. □ 



13 Proof of Theorem 1.3 



The proof of Theorem |1.3| can be completed as follows. Let L C X+ be a language rec- 
ognized by a Turing machine with time function T{n). Then there exists a symmetric 
nondeterministic Turing machine M, with all the properties of Lemma which recog- 
nizes the language L. Take any > 6 and construct the group Giy{S). Let D{n) be 
the Dehn function of this group. Then by Proposition p.Ll| , D{n) < CiT(C2n)^, for some 
constants Ci and C2. By Proposition |12.1| , D{n) > ciT{c2ny, for some positive constants 
Ci,C2. Therefore D{n) is equivalent to T{n)^. 

Similarly we can prove that the smallest isodiametric function of A is equivalent to 
T{nf. 

Let w be an input word for the machine M and Cw be the corresponding input config- 
uration of the machine M. Then by Proposition |12.1| the correspondence w — ^ K{a{cw)) 
satisfies the conditions of Theorem pT3| , in particular w G L if and only if K{a{cw)) = 1 



in Gi\i{S). Theorem L3 is proved. 



Theorem |T]^ follows from Theorem |1.3| and Theorem |1 . 1| . 
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14 Proof of Corollary 1.2 



This corollary almost immediately follows from Theorem |1.3| and its proof. Take any 
Turing machine M which accepts a non-recursive language. We can assume that M 



satisfies the conditions of Lemma |3.1| . Using Proposition [4.1| we construct the S'-machine 
S = iS(Af) for which there is no algorithm deciding whether there exists a computation 
of S connecting a given admissible word W with the fixed admissible word Wo- Now take 
> 6 and consider the group G'j^{S) given by the presentation V'^{M) which is obtained 
from Vn{M) by removing the hub relations. 

In the group G'^{S) a word K{W) where W is an admissible word for S is conjugate 
with K{Wo) if and only if W is connected with Wq by a computation of S. Indeed, if 
such a computation exists then by Proposition |10.1| there exists a disc with boundary 
label K{W). By removing the hub from this disc we obtain an annular diagram over the 
presentation Pg(M) which shows that K{W) is conjugate to K{Wo) (see ||2^). On the 
other hand if K{W) is conjugate to K{Wo) then there exists an annular diagram over 
V'pf[S) with boundary labels K{W) (the outer contour) and K{Wq) (the inner contour). 
We can glue the hub in the hole of this annular diagram and obtain a diagram with one 
hub over Vn{S) which by definition is a disc. By Proposition |10.1| then W is connected 
with Wq by a computation of S. 

Since the language accepted by M is not recursive, G'^{S) has undecidable conjugacy 
problem. It remains to notice that by Lemma the Dehn function of G'g{M) is at 
most 0{7r') (indeed, the van Kampen diagrams over V'^{S) are precisely the van Kampen 
diagrams over Vn{,S) without hubs). 

Notice also that the group we constructed has linear isodiametric function by Lemma 
|0. □ 
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